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Preface 


The reader of Book 1 has already been introduced to simple functions, to 
their derivatives and integrals, to elementary sequences and series, to the 
foundations of probability and statistics, to vectors and mechanics. Our 
present task is to build upon this base. 

Whatever we might have achieved has been due to joint authorship, 
chapters 10 , IS and 14 having been written by F. J. Budden and chapter 12 
by D. Knighton, whilst I,. K. Turner has been responsible for the remain¬ 
ing chapters and for general editorship. As in Book I, the treatment within 
each chapter follows a natural progression and the individual teacher or 
reader can exercise very great freedom in deciding how the chapters 
themselves are to be combined. 

Finally, it is a pleasure to thank Mr Michael Spincer and all his colleagues 
at Longmans for their constant support and consideration throughout out¬ 
work, Oxford Illustrators for the quality of their diagrams and the staff of 
William Clowes for the customary excellence of their printing. 

L.K.T. 

L),K, 


VH 






Notation 


Vectors 


AB, CD.... 

a.b - 

displacement vectors 
free vectors, or position vectors, 
with moduli a , i, c, ** * 

a,h.... 

unit vectors in the directions 

a, 6- 

a + b, a — 6 
ij.k 

vector sum and difference 
unit vectors, with right-hand set of 
mutually perpendicular axes 

r — xi + yj + zk 
dr 

v — — — r 
d t 

dv 

a — — — r 

dt 

MQ) = i (PQ) 

*p(Q) -^wp(Q) 

a) — 6 

(b — 0 

aa 4- fib 
r — — - 

a + 0 

1 " 

p s -Y r t 

n , 

vector r with components x r y , z 

velocity 

acceleration 

velocity of Q relative to P 

acceleration of Q relative to P 

angular velocity 
angu l ar acc deration 

r divides a, b in the ratio /jf;oc 

centroid of r ( (i — 
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NOTATION 


T = 


n 

£ m ' r ' 

j 

n 



centre of mass 

of m i at r,(i = I,.. n) 


a.b — ah cos 0 scalar product of a, b 

— a x b | + a 2 b 2 + 

a x b = (ab sin 0)n vector product of a* b 

— a ~ a %b 2 )i 

+ (a 3 b 1 - a t b 3 )j 

+ i a \h ~ a t b x )k 


Mechanics 

F force F 




resultant force R - P -f Q 


g 

F, N> R 



mv 

e 


W = F.s 
W = \ F.ds 


a c cel era uon due to gravity 
(g sz 9.8 m s" 2 on Earth) 
frictional component {F) and 
normal component {N) of 
resultant reaction (/?) 


coefficient of friction 



angle of friction (= tan 1 fi) 


impulse of constant force F in time t 


impulse of force F in interval ( t x , £ a ) 

momentum of mass m moving with 
velocity v 

coefficient of restitution (or resilience) 

work done by constant force F in 
displacement s 


work done by variable force F 


^mv 


2 


P = — = F.v 
d t 


kinetic energy of mass m moving 
with velocity v 

power of force F 

(= rate of doing work) 


k (T= kx) 



stiffness of a spring 

elastic modulus of spring or string 
of natural length / 





NOTATION 


XI 


X = ~w 2 x 
X 2 = w 2 (a 2 - x 2 ) 
x = a sin (wt + e) 

G = rxF 

/ - I m 2 

G(= 6.67 x iO" M in SI) 


standard equations of simple 

harmonic motion with amplitude 
a and period 2 n{a> 
moment about O of a force F acting 
at point r 
moment of inertia 
constant of gravitation 


Logarithmic, exponential, and hyperbolic functions 


C x d t 

In A' = log* x = I — {x > 0) 

e (as 2 . 718 ) 

c* {or exp x) 

sinh a- = ^(e x - r _x ) 

cosh x — + e~ x ) 

sinh -1 x = In [x + v/(x 2 + 1)] 

cosh -1 x = In [x + yj{x 2 — l)] 


natural (or Napcrian: logarithmic 
function 

base of natural logarithms (In* = I) 

exponential function 

hyperbolic sine 

hyperbolic cosine 

inverse hyperbolic functions 


Coefficients in binomial series 



n{n — 1) (n — 2).,. (n — r + 1) 

7 \~ 


= "CX 


(«eireZ + ) 

(neZ+) 


Complex numbers 
z — x 4 - iy 

— r{cos 0 + i sin 0) = r eis 0 
R{z) = x 
/(z) = iy 

M = r = V (* 2 + '/) 

arg z = 0 (— n < 0 < tt) 
z* — x — iy — r cis (— 0) 


complex number z 

real part of z 
imaginary part of z 
modulus of 2 
argument of z 
complex conjugate of z 


Probability and statistics 

n = £M = H Ps, 


o 2 = n*\ = m* - p ) 2 1 
= £[* 2 ] - / 
= I Ps? - p 2 


G{t) = up/ 


mean or expected value of random 
variable x, distributed with 
probability p 

variance of random variable x 


standard deviation 
p ro ba b i I ity g en era t o r 




XJ 1 


NOTATION 


H = G'(l) 

= G"{\) + G'( 1) - [■(''(l)] 2 

t* = np 
<i l — npq 

<&{x) 


4 >(x) = ®'(*) 





<£(*) d* 


mean and variance of binomial 
distribution 

cumulative probability * or 
distribution, fu notion 
probability density function 


fi 


a 


2 



<t>(x) = 
</>(.v) = 


x<f)(x} dx 

(* - n) 2 <t>(x) dx 

x 2 <p(x) d x — fi 2 

_ * --tx-vWia 1 

>/£* 


Pr = 


T ! 


e 


mean and variance of 

continuous distribution with 

p d <P(x) 


normal distribution with mean 
and standard deviation a 

normal distribution with mean 0 
and standard deviation l 

Poisson distribution with mean // 


£[m] = fi 

V\m\ = — 

n 

a 

V* 


expected value, variance and 
standard deviation of distribution of 
mean m of sample size n (drawn 
from population with mean pt and 
standard deviation tj) 



For a bivariate distribution (x h y t ): 



n n 


.2 = I (*t - *) 2 _ Ljf _ _ a 


expected value of variance s 1 of 
sample size n 

unbiassed estimate for standard 
cIeviation of a popu 1 ation when 
s am pie s i z e n h as stan d a rd 
deviation s 


means of y 


n 


n 


variance of * 












NOTATION 


-2 _ I [h - y ) 2 M 

>■ y 

n rc 

L(*i -*)(?( -F) 




A ?/ 


.2 


^iaL 


Vy 

„ _ . _ J> E<ff 

P ~ n{n 2 - 1) 


T = 


jn{n — 1) 


xiii 


variance of y 
covariance of x, y 


regression coefficient of y on x 

regression coefficient of x on y 

ro rre 1 a i io 11 coe Hi c i en t 

Spearman's rank correlation 
coefficient 

Kendall’s rank correlation 
coefficient 


Matrices and determinants 

A. B. C - 

det A or \A | 

A\B\ CV.. 

A*. B*. C*, 

\ <T 


Sets 

Z 

Q 

R 

C 

Z\Q\R + 
Z*,Q*, R* 


matrices 

determinant of square matrix A 
transposed matrices 
adjoint matrices 
inverse matrices 
cofactors of a tj in del A 
product matrix 


integers 

rational numbers 
real numbers 
complex numbers 
positive integers etc, 

Z. Q> R y but omitting zero 


Groups 

G, * group formed by elements of G 

under binary operation * 

C w cyclic group of order n 


D u 


dihedral group of order 2 n (group 
of reflections and rotations of 
regular n --sided polygon) 












XIV 


NOTATION 


symmetric (permutation) group of 
order n 

is isomorphic to 


Boolean Algebra 

A + B(= Au B} 
AB (= A nB) 

<=, id 

0 


A' 


union of sets A , B 
intersection of sets A f B 
inclusion {A <= B or A id B) 
null (or empty) set 
universal set 
complementary set 


Throughout the book, and in obedience to the recommendations of the 
British Standards Institution and the Royal Society (though contrary to 
the preference of FJ.B.), the aligned dot denotes a decimal point and not 
multiplication: so that 2 . 3 , for instance, means 2 -^ rather than 6. 




Further calculus 


9.1 The logarithmic function, In x 

When we first considered integration, we found that 

f t " +1 

\ f dt — -- + r, provided that n —L 

J n + 1 

When n = — 1 , however, this expression contains the meaningless term 
t° /0 . Nevertheless, the problem of finding J t~ 1 d^ is certainly not meaning¬ 
less, and will arise whenever (which is frequently) we need to know the 
area beneath a section of the curve f(t) = \/t. But the reader will probably 
find that, despite all his efforts, he fails to discover a function whose deriva¬ 
tive is \/t, and must therefore return to a more careful examination of the 
area which this integral represents. 

We start by considering the area beneath the curved(/) — \jt between 
t = 1 and t = x, which we call / (#). 

mi 
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FURTHER CALCULUS 


So / (*) = J — d£, which is defined for all x > 0. 

Clearly, if x > 1 , /(tf)> 0 ; 

if 0 < x < 1, l(x) < 0; 

and /(l) = 0. 

We can now estimate / (*) for any positive value of x by means of counting 
squares or by Simpson’s rule. 

Exercise 9.1a 

1 Count squares or use Simpson’s rule (with four intervals in each case) 
to estimate the values of: 

(?)/(2); (ti) /(3); (Hi) 1(6); (iv) 1(9); (v) l( 12); (a)/(i); (vii) /(j). 


Properties of l (x) 

After this last exercise, the reader may have suspected that 

1(6) = 1(2) + 1(3), 1(12) = 1(2) + 1(6), 

l(3 2 ) = 2 /( 3 ), l(2~ l ) = -1(2). 

More generally, we shall now prove that 
(i) l(xy) = l(x) + l(y), 
and (ii) l(x n ) = nl(x). 

(i) By definition, 

f* i fy i 

l(x) + l(y) = -d/+ -dt, 

it i t 

1 4 / 1 

and we can transform the second integral by changing the variable. 
Regarding x as constant, we let u = xt so that d u = x d£. 

Also t = 1 => u = x, and t = y => u = xy. 

C y 1 C xy 1 

So I — df = I - d u 

1 t v U 


(as the letters t and u are themselves irrelevant). 

m x y C*y i 

Hence i(x) + l(y) = - d^ + -dt 

it Jr ‘ 


l(x) + l (y) = l(xy). 
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This substitution can be illustrated by considering the shaded area 
between t = 1 and t = y and squeezing it, like toothpaste, into the area 
between t = x and t = xy: horizontal dimensions are stretched by a 
factor x and vertical dimensions compressed by a factor 1/x, so that areas 
remain unchanged: 



Hence /(x) + l(y) = / (xy). 


(it) Similarly, l{x n ) 



and if we now let t = u n , it follows that d£ = nu n 1 dw. 


Also t = 1 => u = 1 and t = x n => u = x. 

C x nu n ~ l 

So l(x n ) = I —du = n 
=> l(x n ) — nl(x). 




Now as these are also properties of logarithms, the question immediately 
arises whether / (x) itself is in fact a logarithm. But first we must introduce 
the crucial number e such that 


1 (e) = 1, 

i.e., such that t — e is the ordinate up to which there is unit area beneath the 
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given curve: 



By counting squares, we can show that e » 2 . 72 , and we shall soon be 
able to calculate it to a much greater degree of accuracy. 

Using the above results, we see that 

l(e l(x) ) = /(*)/(*) = /(*)• 

So the areas under the curve to the value e 1 ^ and to the value x are equal. 
These must therefore be the same, and e Hx) = x 


l (x) = log e x 


log e x is usually called a natural, or Naperian, logarithm (after John 
Napier, 1550 - 1617 ) and is sometimes denoted without a base, simply as 
log x; but more customarily as In x. We shall also usually write 


' 1 f 1 

f J 

- d£ and - d* as 

— and 

J * J * 

t 

*/ v 


So, in summary, if x > 0 : 



= log e x = \n x 


— = In x + c — (In x) = - 
x ax x 


and the relationship between 
1 


and In x 
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can be conveniently illustrated by their graphs: 

/Mi /M A 



In particular, we notice that 

when x ^ 0, In * is not defined; 

as x —> + co, In * —> -f oo; 

as *—>() + , In * —> — oo. 

We are now able to find other areas bounded by such graphs. 


Example i 

Investigate the areas represented by 



= In 5 - In 2 


= In f 



f -2 dx 

\ n ) — represents an area below the x*axis, and our natural approach 

J -3 x 

would be similar: 
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FURTHER CALCULUS 




But the reader will rightly object that In (— 2 ) — In ( — 3 ) is completely 
meaningless, since neither In ( — 2 ) nor In ( — 3 ) has been defined. We 
can, however, easily side-step this obstacle by using the symmetry of the 
graph about the origin, observing that 



* 3 dx 

2 X 


-[In xf 2 

- (In 3 - In 2 ) 
-In 1.5 


« — 0 . 41 . 



We have already seen that In x —> — oo as x —► 0 -1-, so the area beneath 
the curve between x = 0 and x = 2 is infinite; and similarly the area to 
the left of x = 0 is infinite. 
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So dx/x is composed of two parts, both of which are infinite. It is 
therefore meaningless and without value. 

Example 2 
Differentiate: 


(i) ln5x 3 ; ( ii ) In 


x 2 + 1 


; (Hi) log 10 x. 


(i) In 5x 3 = In 5 4 - In x 3 = In 5 + 3 In x 

d , d 3 

=> —In 5x 3 = — (In 5 + 3 In x) = -• 
dx dx x 

Or — (In 5x 3 ) = — z x 15x 2 = -• 
dx 5x x 

(«) 

dx \ x 


= — [In (x 2 + 1 ) — In x] 
dx 


1 


x 2 + 1 
x 2 - 1 

x(x 2 + 1 ) 


x 2 x 


Or 

d* \ x 


x x x 2 x — (x 2 + 1 ) 
- x --- 


AT 2 + 1 
X 2 ~ 1 

x(x 2 + 1) 


(Hi) Let y — log 10 x. 
Then x = 10 y 
=> In x = y In 10 
In x 


* In 10 
dy 1 


1 


dx (In 10)x (2.302 6 )x 

Q d 0.434 3 

So — (log 10 *) «- 

dx x 


Example 3 


f a : 2 Cf'(x) C n/4 

(,) J?TT d ' ; ( “ ) J7w‘ d ' ; (S) 1 


tan x dx. 


(i) Let I = 


x 2 dx 
x 3 + 1 
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Now u = x 3 + 1 => Au = 3 * 2 Ax. 


So I 


fida 

= 2 __ = i in u 


+ C 


x 2 dx 
x 3 + 1 


(ii) Let / = 


= j In (x 3 + 1 ) + c. 


/w 


dx. 


Here again we let u — f (x) => da =/'(x) dx. 

" da 


So / = 


= In a + c 


= ln/(x) + c, 

so that we are now able to integrate any quotient whose numerator is the 
derivative of its denominator. 
frc /4 


(m) / = 


tan x dx. 


This can be expressed as 

/. r*i4,--r 

Jo cosx Jo 

So, using (ii) above, 


'«/ 4 _oi 


sin x 


dx. 


I = [ — In cos x] 


1 


tt/4 


= -|ln-^-lnl) = iln2. 


Hence 


*«/ 4 

J 0 


tan x dx = \ In 2 . 


Exercise 9.1b 

1 Given that In 2 = 0.693 15, In 3 = 1.098 61, In 5 = 1.609 44, calculate 
to 5 places of decimals: 

(t) In 4 ; (ii) In 6 ; (in) In 10 ; (iv) In 12 ; (»)ln|; 

(vi) ln^; (vii) lny^; (viii) In 10 6 ; (ix) ln^/ 2 ; (*) In 100 . 

2 Evaluate: 

(i) \ne 3 ; (ii) In 1 ; (Hi) In-; ( l '*0 l n \A- 


3 Solve the equations: 

(i) In* = 2 ; (ii) In * = 100 ; (in) In * = ys; (*») ln * = “ 2 - 
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4 Differentiate with respect to x: 

( 1 ) In 2 x; (ii) lnx 3 ; 


(iv) In-; 


( v ) In 


(vii) In (x 2 + 1 ); (viii) In 


x + 1 


(x) In XyJ(x + 1 ); (xi) x In x; 


(iii) In 3a: 4 ;* 
{vi) In 

{ix) In y/(x 2 

, In a; 
(xn) - 


5 Differentiate with respect to x : 

(z) In sin a:; (ii) In cos x; (iii) Insect; 

(iv) In (sec x + tan x ); (v) In (3 sin x ); (vi) In (sin 3x); 

(vii) In (sin 2 x ); (viii) In (4 sin 2 x ); (ix) (In*) 2 ; 

(x) In (In x). 


6 Find the following integrals: 
^ dx 
3x’ 

+ 1 


(0 
{iv) 
(vii) 


J'-T 


x 2 dx 


1 - X 


(“) 
dx; (t>) 
{viii) 


r dx 

J 

I* dx 

J 1 - 

f x - 1 

J S-2,+ 3 

(xi) f tan 3x dx ; 


x) J* cot x dx; (xi) ^ i 

... f sin x + cos x , . . . f 

xm) — - dx; (xiv) - 

J sin x — cos x J1 


sec 2 x 


+ tan x 


(iii) 
(vi) 
(ix) 
(xii) 
- dx. 


2x + 3 


1 

f 2 xdx 

J X 2 +1 ; 

(4^0*: 

J x 3 + 3x 
j' cot \x dx; 


7 Evaluate (or, if this is impossible, illustrate): 

r 


(0 
{iv) 
(vii ) 
(x) 


dx 

(«) 


*6 

dx 

X 

j 

4 

x + r 

dx 

(V) 


*3 

dx 

x - 4’ 


! 

x - 2 ’ 

dx 

(viii) 


*1 

dx 

3 - 2x’ 


0 

1 - 2 x 5 


(iii) 

{vi) 

{ix) 


dx 

2 2x ~h 3 


{%_ 

J* 


x dx 


cot 6 d6 ; 


* n cos 6 — sin 6 
0 cos 6 -f sin 0 


dd. 


8 Given that In 2 « 0.693 15, use differentiation to estimate (to 5 places 
of decimals): 

(0 In 2 . 001 ; (ii) In 1.999 9. 

9 Find the volume of the solids of revolution formed when the area 

(i) between y = 1 /^/x, x = 1 , x = 2 , and y = 0 is rotated about the 
x-axis; 
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(ii) between y = \/x 2 , y = 1, y = 4 is rotated about the y-axis. 

io By using the Newton-Raphson method, or any other suitable iterative 
method, solve the equation 

x + In x = 3, 

given that the root is close to 2. Obtain the root correct to two decimal 
places. (j.m.b.) 

n (i) Using the Newton-Raphson method or otherwise, solve the 
equation 

In x = \x, 

giving your answers to four significant figures. 

(ii) Find the gradient of the straight line through the origin which touches 
the graph of In x, and hence show that 

In x = otx 

has a solution whenever a < \/e. 

12 Boyle’s law states that, at constant temperature, the pressure p and 
volume V of a gas are such that pV = c, where c is a constant. If the work 
done by the gas when expanding is j p dV, show that this is constant 
(= c In 2) whenever it doubles its volume. 

13 If an animal grows without changing its shape then its length x, surface 
area S and volume V are such that 

S = ax 2 , V = bx 3 (where a, b are constants). 

Hence show by differentiation: (a) without use of logarithms; (b) after 
first taking logarithms, that slight increases Sx, 8S and SV are related by 

SS Sx SV Sx 

— = 2—> — = 3—> 

S X V X 

i.e., Xkv&Xfractional increases of£ (and V) are twice (and thrice) the fractional 
increase of x. 

9.2 The exponential function, e x 

We now proceed to investigate the function f (x) — e x . This is clearly a 
power function, and we start by finding its derivative. 

y = 

x = In y 
dx 1 

&y y 



Now 




9-2 THE EXPONENTIAL FUNCTION, t 


So 4 - («*) = 

dx 

and we immediately see that e x is the particular power function whose rate 
of increase is always equal to its value: 



It was mentioned earlier that power functions are sometimes referred to 
as exponential functions. Because of the supreme importance of e* it is 
usually called the exponential function, and is sometimes written exp x. 
Similarly we see that 

4-(o = * -x x -i = 



so that e~ x is the corresponding function of exponential decay. 

These two functions, and variants of them, occur whenever the rate of 
growth of a quantity, or its rate of decay, is determined by its present 
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amount. So, for instance, the spread of an infection in the first stage of an 
epidemic is likely to involve a positive exponential function, whilst the 
decay of a radioactive element is governed by a negative exponential. 


Lastly, just as 
so 



Example i 

Differentiate: (i) e * (i) 2 ; (ii) 10*. 

(i) 4- (e xl ) = e xl x 2x = 2x e x \ 

dx 

(a) 4~(m (* lnl V 

ax dx 

= i-(^lnlO) 
dx 

= ** lnl ° x In 10 
= 10* In 10. 


Alternatively, y = 10* 

=> In y — In 10* = x In 10 

y dx 

=> ^ = y In 10 

dx * 


dx 


(10*) = 10* In 10. 


Example 2 
Find the integrals 



(ii) 


l 


2 X dx. 


(i) Since — (e 2x ) = 2 e 2 *, 

dx 


it immediately follows that 



(ii) As in Example i, we can easily show that 
d 
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J 3 


So 2 X dx = -—- + c. 

In 2 


Example 3 

When a charged condenser is discharged through a particular circuit con¬ 
taining an inductance and resistance, the current x at time t obeys the 
equation 

d 2 x dx 

—y+ 2— + 5x = 0. 
dt 2 d t 

Show that x = e~ % cos 2t satisfies this equation and sketch the graph of the 
function. 

If * = e~ x cos 2 1, 

dx 

then — = e~ l x —2 sin 2 1 — cos 2 1 

dt 

= £ _t (— cos 2t — 2 sin 2 /), 

d 2 x 

and —2 = £~*(2 sin 2 ^ — 4 cos 2 f) — e~ t ( — cos 2t — 2 sin 2 f) 
dt 

= — 3 cos 2f + 4 sin 2£). 

So we verify that 

d 2 x dx 

2 ~—h 5 a: 
dt 2 dt 

= e~ t {( — 3 cos 2^ + 4 sin 2t) + 2( — cos 2t — 2 sin 2f) + 5 cos 2t} 

= 0 . 

Now x — e~ x cos 2 1 is the product of an exponential decay and a cosine 
function: 
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Combining these, we obtain the graph of x = e~ f cos 2 1 and we readily see 
that it represents a decaying oscillation. 



Example 4 

A bathful of water is at a temperature of 80 °C in a large room whose tem¬ 
perature is 20 °C. The water cools at a rate which is proportional to its 
excess temperature and takes 5 min to reach 70 °C. 

Find the temperature of the water after t min. 

If the excess temperature is 6 °C, the rate of cooling is 


d 0 

d 7 


So 


dd 
d£ 
d£ 
d 0 


— k6 , where A: is a constant, 

1 

~~ ~~k0 


=> t = — -In 6 + c. 

k 

But, when t — 0, 6 = 60. 

So 0 =-In 60 -f c 

k 


c = 7 In 60 
k 


t = — In 0 + - In 60 
k k 

- _I, 1 

k n 60 


In — = — kt. 
60 
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So far we have only used the general ‘ law of cooling ’ and the ‘ initial ’ 
condition that 9 = 60 when t = 0 . But we also know that when t = 5 , 


9 = 50. 


So 


So 


lnfg = -Ax 5 

k = -i In | = | In f « 0.036 5. 

, e 

In — = — 0.0365< 

60 

n 

_ _ — 0.0365f 

60 _ 

0 = 60 e~° 03651 . 


So the temperature of the bath after t minutes will be 
20 + 60 e _0 0365 f 



tj min 


We can now use this function to calculate the temperature of the bath at 
any subsequent time. For example: 

if£ = 10 , temperature = 20 + 60 e ~ 0 ' 365 
= 20 + 60 x 0.694 
= 20 + 41.6 = 61.6 °C; 
if£ = 20 , temperature = 20 4 - 60 e~ 0 ' 73 
= 20 + 60 x 0.482 
= 20 4 - 28.9 = 48.9 °C. 


Exercise 9.2 


1 Use tables to evaluate e x when x = 10, 100, yo, — 10. 


2 Find values of x such that 

(i) e x = 10 , 1 000 , 10 6 ; («) 


10 “ 4 , 10 " 6 . 
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3 Differentiate with respect to x: 


(*) 


(ii) 

e~* x \ 


(iii) 

x e x \ 


(iv) 

x 2 e x ; 

(v) 


(vi) 

e x 

5 

X 


(vii) 

e~ x sin x; 

(viii) 

e 2x cos 3x 

; (™) 

3*; 

(x) x x . 

4 Find the following 

integrals: 




(,) J 

e 2x dx; 


(ii) 

1 

dx; 

(Hi) 

J 

| x e x 2 dx 

(iv) 

/% 

e*' mx cos x 

dx; 

w j 

% 

x e 

-** dx; 

(vi) 

10 x dx. 


5 Evaluate the following integrals: 


*2 

f 3 r 1 , r 

e~* x dx; (ii) 

e 2x dx; (iii) xe * x dx; (iv) 

Jo 

J i Jo J c 


6 Investigate the stationary values and sketch the graphs of the following 
functions: 

(i) xe ~ x ; (ii) x 1 e~ x \ ( iii) x n e~ x ; (■ iv ) e~ x \ 


p 

7 Find e x dx. 

Jo 

By letting X-+ + oo, investigate the area which Jies between the curve 
y = e~ x and the two axes (for convenience called J q e~ x dx). 


8 After time t the displacement x of a heavily damped pendulum is given 
by x — 2 e~ l sin t. 

(i) For what values of t is the displacement zero? 

(ii) For what values of t is the displacement greatest? 

(iii) Sketch the graph of x against t. 

(iv) Prove that 


d 2 x ^ dx 
~r~2 4* 2— 
d^ 2 d^ 


+ 2x = 0. 


9 A biologist knows that a certain organism, of which there is initially 
2 g, is growing continuously at a rate of g/day for each gram of its mass. 
If its mass after t days is m g, 
dm . 


(i) show that 


At 


(ii) deduce that m = 2 e x 


,tfl o. 


(iii) by what percentage does it increase each day? 

(iv) how long does it take to double its mass? 
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io A sample of radium loses mass at a rate which is proportional to the 
amount present. 

(i) If its mass is m after time /, show that 


dm 



(«) if its initial mass (when t = 0) is m 0 , deduce that 
m = m 0 e~ kt ; 

(Hi) if its mass is halved in 1 600 years, calculate k ; 

(iv) calculate the percentage rate of decrease per century. 

ii A man with toothache leaves home at noon for the dentist, whose 
surgery is 1 km away. He starts by running at lOkmh -1 , but gradually 
slows down so that his speed is always proportional to his distance from the 
surgery. 

(■ i ) How far has he gone in the first 5 min? 

(ii) How long will he take to get within 100 m? 

(in) When will he arrive? 


9.3 Interlude: infinite series 


In Chapter 6 we investigated a number of finite series and also mentioned 
the ‘sum to infinity’ of a geometric progression. 

It was proved, for instance, that if S n is the sum of the first n terms of the 
series 


1 + i + i + in— ’ 


1 — 

then S n = —-ii'- = 2 - 


1 


1 -i 


Now as n —> 00 , 


2»»-i 

0 . 


So S„-> 2, 

and we say that the infinite series 

l + i + i + ^ + -fg + -- - 

is convergent, and that its sum (or sum to infinity ) is 2. 

More generally, we can consider the infinite series 

Wj -f w 2 + w 3 + • • • 

and suppose that the sum of its first n terms is S n , so that 
S n = u x + u 2 + u 3 + • • • + u n . 
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If S n —> S as n —> oo, we say that the infinite series is convergent , and that 
S is its sum . 

It is beyond our present scope to investigate the convergence of series, 
but we can consider some elementary examples: 

Example i 

1 + 2 + 3 + 4 + 5 H - 

Certainly the terms of this series become very small. If we look at the 
successive values of S n , we obtain (approximately): 

1, 1.500, 1.833, 2.083, 2.283, ... 

and before long it seems that the values of S n are having such a struggle to 
increase that they must necessarily tend to a limit, which will be the sum 
of the infinite series. But a little more thought will cause us to write the 
series as 

1 + 2+ (i + i) + (5 + 6 + 7 + l) + (9 + m + U“ + ' * * + Hi) d * 

>l+^-i- 2x-| + 4-Xfj + ^ X L6 + * * * 

> 1 + \ ^ T" \ + 2 + * * * 

Now this series, although growing very slowly, is completely unlimited. So, 
contrary to our expectations, the original series 

1 + 2 + 3 + i + * * * 

is also unlimited and does not converge (even though, as we shall later 
prove, the sum of its first million terms is less than 15). 

Example 2 
The series 

_L _L _L _L _L .1 J_, i_ . .. 

12 I ^2 ^ ^2 ^ ^2 ^ ^ 02 ^ y2 g2 

can be written as 

^+(^+^ + (^2 + 

< 1 + 2xi + 

< 1 + i + ? + s ■+-= 2 

In this case, therefore, the series 

1 1 1 

1 d" 7^- + 779 - + T9 + ’ ‘ ' 

2 3 2 4 

does converge to a sum less than 2 (and actually, though we cannot prove it 
here, to in 2 ). 




9*3 INTERLUDE : INFINITE SERIES ig 


Example 3 

As a warning that we must treat infinite series with a certain respect, we 
now consider the series 

1 + _ 

It can be shown that this series is convergent and also, perhaps sur¬ 
prisingly, that its sum is In 2 ~ 0.693. So it would appear to be an innocent 
operation to rearrange its terms as follows: 

1 2 ' 3 4'5 6 ' 7 8 ' 

-1 J_i_ _|_ 

“ 1 2 4'3 6 8 ^5 10 12 ’ 

= (i - i) - i + (i - i) - i + (i - *) - ■ ■ ■ 

- i _ 1 i I _ ! 1 _ 

2 4 ■ 6 8 ' 10 

= ^ ” 2 ^ 3 “ 4 + 1 — '*■)• 

But the reader will immediately see that we have ‘ proved ’ that 
In 2 = i In 2 . 

This paradox, and others like it, show that the theory of infinite series 
must be developed with very great care: though we cannot attempt this 
here, it forms a major branch of modern mathematics. 

Example 4 

We now return to the geometric series 

\ + X + X 2 + X 3 -\ - 

The reader will recall that 



and that if |#| < 1, x n —► 0 as n —► 00 . 

So if \x\ < 1 the series is convergent to the sum- 

l — x 

This is a particular case of a power series, whose terms are successive 
powers of x and which converges for a set of values of x called its domain of 
convergence. 

The general power series can be written 
a 0 + a l x + a 2 x 2 + a 3 x 3 + • • • 

where a Q , a x , a 2 ,. .. are particular constants, and we should expect such 
a series, if convergent, to have a sum that is a function of x. It will, for 
instance, be shown in the next section that 

X - jx 2 + i* 3 - }x 4 + • * • 
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is convergent to the sum In (1 + x ), provided that 1 < x ^ 1; 
and that 


2 3 v 4 

XXX 

1 + * H-{-1-+ • 

2! 3! 4! 


is convergent to the sum e x for all values of x. 


9.4 Maclaurin series 


We shall begin by reversing the question of the last section and ask whether, 
if we are given a particular function f (x), we can discover a convergent 
power series of which it is the sum. For this purpose we shall have to make a 
number of assumptions. 

Firstly we shall restrict ourselves to functions which are differentiable 
not only once, but any number of times, i.e. whose derived curves are all 
continuous and smooth. 

Secondly, we shall assume the existence of such a series for a certain 
domain of x, and our problem will be simply to find the series, granted that 
it exists. This may seem rather strange but, throughout mathematics, prov¬ 
ing such existence is frequently a major problem, and the only statement 
we shall be able to make here is that, if a particular function is expressible 
as a power series, then this series must be such and such. 

Lastly, we assume the fact that if this series exists and is convergent for a 
certain domain of x, then, within this domain, it is differentiable ‘ term by 
term’. By this we mean simply that the derivative of its sum is equal to the 
sum of its derivatives, even though there is an infinite series of them. In 
other words, that just as 


and 


d d u dv 

-(« + ,) = - + - 

d d u dv d w 

-(«+,+«.)= s +-+ Tx 


and so on for any finite series, so also it is true for our infinite series that 
d du dv dw 


Let us suppose that f (x) is a function which, throughout a certain 
domain including x = 0, 

(?) is differentiable any number of times, and 
(ii) is the sum of a convergent power series. 

Let this series be 

f(x) = a 0 + a x x + a 2 x 2 + a 3 x 3 + a 4 x 4 + • • • • 
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Differentiating term by term and putting x = 0, we obtain successively: 


f(x) = a 0 + a x x + a 2 x 2 + a 3 x 3 + a 4 x 4 + • • • 
f'(x) = + 2 a 2 x + 3 a 3 x 2 + 4 a 4 x 3 + ■ ■ ■ 

f"{x) = 2 a 2 + 6a 3 x + 12« 4 * 2 + • • • 
f"'{x) = 6 a 3 + 24« 4 a: + • • • 
f""(x) = 24 a 4 + • • ■ 


/(0) = « 0 
/'(0) = a, 

/"(0) = 2 !a 2 
/'"(0) = 3!a 3 
/""( 0) = 4!a 4 , etc. 


So a 0 =/( 0 ), =/'( 0 ), 


/"( 0 ) 
-, 


/'"( 0 ) 
-9 


/ <n) (0) 

and similarly a. = - 

n ! 


Hence 



This is known as Maclaurin’s series (after the Scots mathematician Colin 
Maclaurin, 1698-1746), which we can now illustrate by three very im¬ 
portant examples: 


Exponential series 
If f(x) = 

then /'(*) =f"(x) = ••• =/<">(*) = 

So /(0) =/'(0) = ...=/W(Q) = 1. 


Hence 


x 2 

3 4 

at jr 

x n 

e* = l + x + — 

+ — + — + ■ 1 

. . -" + * • * 

2 ! 

3! 4! 

n ! 


It can be proved that this series is valid for all values of x; as, indeed, we 
would expect, since however large x might be (say x = 1 000 ), the great 
size of x n is eventually overcome by the even greater size of n !. 

We can now use this series to calculate e to any required degree of 
accuracy. For, putting x = 1, 


11111 
* “ 1 + l! + 2! + 3! + 4! + 5! + 


and the calculation can be set out to seven places of decimals: 
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1 

1 

V. 

1 

2 ! 

1 

31 

1 

4! 

1 

5! 

1 

6! 

1 

7! 

1 

8 ! 

1 

9! 

1 

ToT 
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1.000 000 0 

2) 1.000 000 0 

3) 0.500 000 0 

4) 0.166 666 7 

5) 0.041 666 7 

6) 0.008 333 3 

7) 0.001 388 9 

8) 0.000 198 4 

9) 0.000 024 8 

10) 0.000 002 5 

0.000 000 2 
2.718 281 5 


We see that this series is rapidly convergent and that e « 2.718 281 5. 

The careful reader will note that this value of e is subject to two sources 
of error: 

(i) because individual terms have been calculated only to seven places of 
decimals. 

When, for instance, we state that — = 0.166 666 7, 

3! 

we are asserting that it lies between 0.166 666 65 and 0.166 666 75 and 
so is subject to a maximum error of 5 in its eighth decimal place, i.e., 
5 x 10~ 8 . As there are eight such terms which have been corrected in this 
way, the maximum error from this source is 40 x 10 - 8 = 4 x 10 _ 7 . 

This is known as the rounding error. 

(ii) because the series was cut short at 1 / 10 !. 

The remainder after this term is clearly 

1 1 1 

In + 12! + Tv. + '" 

l 

+ 12 X 13 


11 ! 


= t^ 1 


12 
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1 w i 12 

11! X 1 - ^2 IT x 111' 


12 

Now Ti-TTT < 2.7 x 10“ 8 . 

11 x 11! 


So the maximum error from this source is less than 2.7 x 10“ 8 . This is 
known as the truncation error. 

We therefore see that the total error is less than the sum of these, and so is 
less than 4.27 x 10“ 7 < 5 x 10“ 7 . 

Hence e lies in the range 2.718 281 5 + 5 x 10“ 7 . 

=> 2.718 281 < e < 2.718 282 

=> e = 2.718 28, correct to five places of decimals 

(in fact e — 2.718 281 8 to seven places). 


Exercise 9.4a 

1 Evaluate the following functions at intervals of 0.5 from x — — 1 to 
x — -j- 2: 

(i) l; ( it) 1 + (Hi) 1 + * + \x 2 \ 

(tv) 1 + * + %x 2 + %x 3 ■ (v) 1 + * + jx 2 + ±x 3 + £x 4 ; 

(vi) e x . 

Hence, on a single accurate figure, plot the graphs of these functions from 
x = — 1 to x = +2. 

2 Find the power series, as far as the x 4 term, of: 

(0 (**) * 2X ; (iii) i(e x + e~ x ); ( iv) ±(e x - e~ x ). 

3 Use the exponential series to calculate to five decimal places: 

{i) yje; ( ii ) (iii) e 0A . 

e 


Logarithmic series 

If In x = a 0 + a x x -f a 2 x 2 + a 3 x 3 + • • • 
it would immediately follow that 
a 0 = In 0. 

But In 0 does not exist, so we cannot express In x as a power series. 
We can, however, consider the function In (1 + x). 



/(0) = In 1 = 0 
/'( 0 ) = 1 
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Now f(x) = In (1 + x) 

1 


/'« 


1 + x 


/"(*) = - 


/'"(x) = + 


1 


(1 + x) 2 

2 

(1 + *) 3 


/"(*) = (- 1)" -1 ^ 


D! 


/'(0) = -1 

/'"( 0 ) = +2 

/"(0) = (-l)"- 1 ^ - 1)! 


(1 + x) n 

So, with the usual assumptions for Maclaurin’s series, 

2! 3! rc! 


In (1 + x) = 0+ lx + 


=> In (1 + x) = x — \x 2 -f ^x 3 — }x 4 H-H---* * ( 1 ) 

and it can be shown that in this case the domain of convergence is 

— 1 < x < 1 . 

Putting —x for x, we obtain 

In (1 — x) = —x — \x 2 — |x 3 — \x 4 — • • • ( 2 ) 

whose domain of convergence is —1 < — x ^ 1 , i.e., — 1 ^ x < 1 . 

If we now subtract these two results, we obtain 
In (1 + x) — In (1 — x) = 2 (x + |x 3 + 3 -x 5 H-) 

=> \ In ——— = x + jjX 3 + jx 5 + * * * (3) 

1 — x 

which is convergent provided both the original series are convergent, i.e., if 

— 1 < x < 1 . 

It is instructive to use these series to calculate In 2. 

If we put x = 1 in (1), we obtain 

In 2 = 1 — 2^3 — 4 

But the millionth term is — 1/10 6 and so is still affecting the sixth decimal 
place. The series, therefore, converges so slowly that it is not very useful 
for the calculation of In 2 . 

If, on the other hand, we use (3), we see that to calculate In 2 it will be 
necessary to let 

1 + x 
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Putting x = f, we obtain 

2 In 2 = f + f (f) 3 + ^ ("3) 5 + * • • 

=> In 2 = 2[i + i(i) 3 -P i(i) 3 H-] 

and it is clear that this series converges much more rapidly. How many 
terms of this series need to be taken in order to determine the sixth decimal 
place ? 


Exercise 9.4b 

1 Evaluate the following functions at intervals of 0.5 from x = — 1 to 
x — -f-1: 

(i) x; (it) x — fx 2 ; ( Hi) x — fx 2 + j^x 3 ; 

(iv) x — fx 2 + fx 3 — fx 4 ; (v) In (1 + x). 

Hence graph these functions on a single diagram from x = — 1 to + 1. 

2 Find the power series, as far as the x 4 term of: 


(*) In 1 + 


(ii) In (1 + 2x ); (Hi) In- 


1 -|- x 

3 Use the series for Jr In-to evaluate, to three decimal places: 

1 — * 

(*) In 3; (ii) Inf- 
Hence calculate In 5. 


Binomial series 

We now turn to the function 
fix) = (1 + x) n (neR). 

Now f(x) = (1 + *)" => /(0) = 1 

=> f'( x ) = n ( 1 + x)"' 1 => /'(0) = n 

=> f"{x) = n(n - 1)(1 + ^)"- 2 => /"(0) = n(n - 1). 

More generally, 

f (r) (x) = n(n - 1)... (n - r+ 1)(1 + *)" _r 
=> / ,r) (0) =n(n- 1). ..{n - r +1). 

So, using the Maclaurin series, we obtain the binomial series: 



and it can be shown that the series is convergent, provided |.v| < 1 . 
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If n e Z + , we notice that the series terminates and reduces to the binomial 
theorem. 

For instance, if n = 4, 

, 4x3, 4x3x2 o 4 x 3 x 2 x 1 4 

(1 + x) 4 = 1 + 4x + —£T X + -^- x + ---* 


3! 


4! 


+ 


4 x 3 x 2 x 1 x 0 

-a 

5! 


4 x 3 x 2 x 1 x 0 x -1 

_l——-- x + ■ 

6 ! 


= 1 + 4x + 6x 2 + 4x 3 + x 4 . 
More generally, if n G Z + , 
n{n — 1)(« - 2)...(« - r + 1) _ n 


r ! 


C r if r ^ n, 
and 0 if r > n 


so that the binomial series simply becomes the binomial theorem. 

If n $ Z + it is impossible to speak of "C r , as _3 C 2 , *C 3 etc. have no 
meaning. Nevertheless, we find it convenient to have an abbreviation for 
the coefficients of the binomial series, and write 

n(n — 1) (n — 2) (n — 3) .. . (n — r + 1) 




f-2\ (-2)(-2 - l)(-2 - 2) _ 

Ur si • 

whils ' C) ‘ 1 Ur il - 6 - " c - 

With this notation we can write the binomial series as 



Example i 
Find a series for (1 




1 + 


(- 2 ) 



+ 

+ 


( — 2) ( — 3) ^ xj 

( — 2) ( — 3) ( — 4) / x\ 3 
3! V V 


+ ••• 


= 1 + X + f * 2 + 2* 3 H- 




which is convergent if 
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1*1 < 2 . 


< 1 


Example 2 

Use the binomial series to estimate 4/999 to seven places of decimals. 

4/999 = (1 000 - 1) 1/3 


= 10(1 - 0 . 001 ) 1/3 . 

x 

Now (1 - 0.001 ) 1 ' 3 = 1 + |(-0.001) + 1 

+ 3 * -f x ~3 (_qoo 1 )3 + 


2 ! 


(- 0 . 001 ) 


3! 


4! 


- 0 . 001) 4 +• 


The first three terms are 

+ 1.000 000 000 0 

- 0.000 333 333 3 

- 0.000 000 111 1 

= + 0.999 666 555 6 


It is easy to see that the maximum rounding error is 10 10 . Also the 
remainder is 

5 10 

- 8 r (0 - 001)3 - 243 (0 - 001)4 “"-’ 

so the truncation error is 


^ (0 - 001)3 + ^ (0 - 001)4 + ••• 


< — ( 0 . 001) 3 [1 + ( 0 . 001 ) + ( 0 . 001) 2 + • • •] 

81 

5 


< — x 10- 9 x ^ < 10 - 10 . 
81 999 


Hence the total error < 2 x 10 10 

and 4/999 = 9.996 665 556 

with a maximum possible error of 2 x 10 “ 9 

So 4/999 lies between 9.996 665 554 
and 9.996 665 558. 


Hence 4^999 — 9.996 665 6 to seven decimal places. 
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Exercise 9.4c 


1 Find the first four terms of the power series of the following functions, 
in each case stating their domain of convergence: 

(*) V( ] + *); (») . j- ■■■; (»i) (i + *)~ 3 ; 

(iv) (1 - 2x)~ 1/2 ; (v) (2 + x)~ l ; ( vi ) ^/(4 - 2x). 

2 Evaluate, to five places of decimals : 

(}) y/l-01; (it) y 0.99; (m) V4.1; (iv) */ 27.27; (v) 

3 Obtain binomial series for 1/(1 — x) 2 and 1/(1 — x) 3 , and check by 
means of differentiating the series for 1/(1 — x). 

Hence sum the series: 

1 + 2 (|) + 3(i ) 2 + 4(ir) 3 +-... 

4 (i) Using yj(x 2 + 1) = x(l + l/x 2 ) 1/2 , show that if x is large: 

■yj(x 2 + 1 ) « x + --+ • • * (known as an asymptotic series). 

(ii) Find the first three terms of similar asymptotic series for \/(x — l ) 2 
and (x 3 + 2 ). 


Taylor series 


The Maclaurin series enable us to find the value of a function near x — 0 
in terms of its value and those of its derivatives at x = 0. If we now seek 
to investigate its value in the neighbourhood of another point x = < 2 , we 
can let the point be x = a + h and write 

/(« + h) = F(h) => f(a) = F( 0) 

so that f'(a + h) = F'(h ) => f'(a) = F'( 0) 

f"(a + h) = F"(h) => f"(a) = F"(0). 

h 2 

But F(h) = F( 0) + hF'( 0 ) + —F"( 0 ) + ■ ■ • 



which is known as the Taylor series (after Brook Taylor, 1685-1731), and 
which enables us to find the value of the function / (x) near x = a in terms 
of its value and those of its derivatives at x = a. 

In particular, if h is very small, 

f(a + h) ~f(a) + hf'(a) 
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provides a linear approximation to the function, already familiar in the 
form 


/'(«) 


_, /(« + h) -/(a) 

A 


Exercise 9«4d 

1 Find, as far as the x 4 term, Maclaurin series for 

(i) sinx; (ii ) cosx; (m) tan*; (it/) sec*; (v) e x sinx; (vi) Insect. 

2 Use Maclaurin series to calculate to five decimal places, tan 0.1, tan 1°, 
tan T. 

3 Find the first four terms of the Taylor series for tan (tt/4 4 - h) and 
hence calculate, to five decimal places, tan 46° and tan 45° 1'. 

4 Obtain the Maclaurin series for tan -1 x and check by integrating the 
binomial series for 1/(1 + x 2 ). Hence 

(i) obtain Gregory’s series: 

_ = i _. 

4 1 3 + 5 7 + 

(ii) use the result (see Exercise 5 . 13 a, no. 3 ) that 

— 1 1 1 

— — tan w -h tan i 

4 J 

to show that 

l = Ci - m 3 + i(i ) 5 -••■] + [i - i(i ) 3 + |(i ) 5 


9.5 Integration by parts: reduction formulae 

We now return to the problem of integration. As was observed in Chapter 4 , 
a result about differentiation can often be transformed into an equally 
useful one about integration, and we saw how the chain rule for derivatives 
led to the method of substitution for finding integrals. We now proceed to 
look again, in just this way, at the formula for the derivative of a product. 
If u and v are two functions of x , we know that 



So 
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This is the rule for integrating by parts and is useful whenever the product 

du . dv 

-—vis easier to integrate than u — 
dx dx 

If we now replace u and v by U and V, where 
dv 

U = u and V = —> 
dx 

the result can be written as 


UV dx = U Vdx 


V dx ) dx 


Example i 


x cos x dx = x sin x — 1 sin x dx 


= x sin x 4 - cos x + c. 


Example 2 


x 2 e x dx = [x 2 e x ~\ — 2 x e x dx 

0 Jo 


= e — 2 |[x — J 1 e x dx 

= e-2{e - (e - 1 )} 

= e - 2. 


Example 3 


In x dx = 1 x In x dx 


= x In x — x - dx 

J x 

— x In x — x + c. 


Example 4 


e x sin x dx = (— e x ) sin x 


- e x ) cos x dx 


— —e *sinx 4 - e x cos x dx 


f Conveniently remembered as 
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— e x sin * + ( — e ) cos x — 


-e x ) (— sin x) d# 


2 e x sin x d* = —e x (sin x + cos x) + c 


e x sin x dx = —\e x (sin x + cos *) + c' 


Exercise 9 . 5 a 

Find the following integrals: 

1 (i) f*£ x d*; (■ ii) fxsinxdA:; ( Hi) fx^ 2 ;)C dx; 


(iv) x In x dx ; (v) x cos — dx ; (vi) 


2 (i) x 2 e x dx; (ii) x z sin x dx 


iii) 6 sec 2 Odd; (iv) u\0 u du. 


3 (i) £ x sin*dx; (ii) £ x cosxd*; 


iii) e x cos*dx; (iv) e x sin 2x dx. 


4 (i) (ln^) 2 ; (ii) tan 1 xdx; 


(iii) J sin 1 u du; (iv) J x tan 1 x dx. 
5 Evaluate: 

P - .. f** 

(i) xe x dx; (ii) I A:sin 2 ^dx; 


(iii) t 2 co$^tdt; (iv) u 2 In u du. 


6 A mathematical model is constructed to describe the situation in a 
factory. The number of workers w which an employer will require is a 
function of the wage rate £r per week for each worker where w = 100 e~ 0,04r . 
If the union wishes to maximize the total pay of its members required by the 
employer, what rate will it ask for? (Assume that all workers belong to the 
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Reduction form ulae 

It is sometimes possible to find a formula which will reduce a given integral 
to a simpler one of the same type, and then to find the required integral by 
successive applications of this process. 


Example 5 

If l 


■-h 


~ x dx, evaluate h. 


Now I n = x n (-e~ x ) 


(nx n - l )(-e~ x ) dx 


= -x"e~ x + n j x”- 1 


=> I n = — x" e x + . 

Hence I 3 = — x 3 e~ x + 3 / 2 
I 2 = -x 2 e~ x + 2f 
J 1 = -x e~ x + I 0 . 

But Iq — —e x 4~ c. 

So I 3 =-x 3 e~ x + 3(-x 2 e~ x + 2f) 

= — (x 3 4 - 3x 2 ) e~ x 4- 6 ( — x e~ x 4- I 0 ) 
= -(x 3 + 3x 2 + 6 x) e~ x + 6 1 0 
= — (x 3 + 3x 2 4 - 6 x 4- 6 ) e~ x 4* c. 


Example 6 


I„ = 


sin” x dx 


n* fin 

I n — I sin" x dx = I sin" 1 x x sin x dx 


= [sin" 1 x ( — cos x)] Q 




r 


(« — 1 ) sin" 2 x x cos * ( — cos x) dx 


= 0 4 - 


sm x cos 


So 


(t. — 1 ) I sir 
= (« — 1 ) I sin" 2 x (1 — sin 2 x) dx. 

r f ■) 

/ B = (n - 1 ) | sin" 2 X dx - J sin" x dxj 
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/„ = («- 1)(/„_ 2 - q 

nl„ = (n - 1 )/„_ 2 


L = 


n — 1 


and this reduction formula enables us to calculate I n in terms of /„_ 2 . 
But we know that 


L = sin x d* = [ — cos x ] = 1 

Jo 0 


and I ( 


0 = 1 d * = M o ’' = frt. 

* 0 


So it follows, for instance, that 

r — It — 1 v 2; 1_ l 

^5 “ 5 i 3 — 5 X 3^1 — 5 X 3 X 1 “ 15 

and I 6 = f/ 4 = f x |/ 2 = | x f x ^/ 0 


— 5. y 3. y JL y -L<jt — 5tT 

— 6 A 4 A 2 A 2 /L — 32 




Similarly, if J n = J cos " * d*, we can put 


u = ~n - x 
d u = — dx. 


f° 

So J n — I cos"(|tt — u) x — dw 


= I sin” u du = I„. 


r 

Jo 


Summarising, if I n 


= sin” * d;t, 

Jo 


J' = 




cos" x d* 


then 



Exercise 9.5b 





1 Evaluate: 





(0 

% 

sin 4 0 d0; 

0 

(«) 

•/ 

li 

O 

cos 3 6 d6; 

fi* 

(m) J cos 6 0d0; 

{iv) 

f*£jt 

| sin 7 6 d0; 

'0 

M 

•i* 

0 

cos 8 0 d0. 
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2 Evaluate: 


f* 

J sin 5 0 d 0 ; (ii) J 


cos 3 0 d 0 ; {Hi) cos 3 0 d 0 ; 


#) f' 


(iv) 


sin 6 0 d 0 ; (v) 


sin 7 0 d6. 


= j (In *)• d*, 


~2 K 


(i) | sin 5 0 d 0 ; 
o 

3 ^ /, 

Hence find I 3 . 

4 Use tan" 0 = tan " -2 0 (sec 2 0 — 1) to find a reduction formula for 


show that I n = x(ln x)" — nl n _ x . 


I„ = 


tan" 6 d 6, and hence find I 4 and I 6 . 


5 ^ In- 


sec" 0 d 0 , prove that 


(n + 1 ) /„+ 2 = tan 0 sec" 0 + w/„ - 


9.6 Interlude: partial fractions 

At an early age we learnt such simplifications as 

2 1 _ !0 - 3 _ 7 

3 ~~ 5 “ 15 “ L5 5 


and, rather later, when we learned that 

1 1 (x + 1 ) — (x 1 ) _ 2 

x — 1 x + 1 (x — 1 ) (x + 1 ) a: 2 — 1 

we had little doubt that this too represented a simplification, as 2 /(x 2 - 1 ) 
is generally more useful and more convenient than \/{x — 1 ) — l/(x + 1 )* 
In the next section we shall frequently be needing to find such integrals as 



for which there is no immediately obvious answer. But a little meditation 
might lead us to reverse the above process and say that 



2 

x + 1 




+ 1 


dx 


= -ft In (x — 1 ) — ■£ In (x + 1 ) + c 


— \ In 


x — 1 
x + 1 


+ c. 



g .6 interlude: partial fractions 35 
Obviously we were extremely fortunate to know that 

1 1 JL 

1 _ _2_ 2 

x 2 — 1 x — l X + \ 

and our next task will be to find a process whereby similar rational func¬ 
tions (i.e. the quotients of two polynomials) can be expressed in terms of 
such partial fractions. 

The full theory of partial fractions is beyond our present scope, but we 
can illustrate the procedure by a number of examples: 

Example 1 

Express in partial fractions 
x + 1 

(x — 1 ) (x — 2 ) 

Hopefully, we write 

x + \ _ A ( B 

(* - 1) (x - 2) = * - 1 + X - 2 

where A, B are constants which are to be determined 

=> x + \ = A(x — 2) + B(x — 1). 

Putting x — 2, 3 = Ax 0 + Bx 1 => B — 3. 

Putting x = 1, 2 = ^4x— 1 +^x 0 => A = —2. 

So the required partial fractions would be 

-2 3 

--> 

x — 1 x — 2 

and it is quickly verified that these can be combined to give the original 
expression. 

Example 2 

We now seek to put into partial fractions the more complicated expression 

2x 3 — 3x 2 — x + 5 
2x 2 + x — 1 

First of all we observe that 2x 3 — 3x 2 — x T 5 can be divided by 
2* 2 + x — 1 to give a quotient x — 2 and remainder 2 ^ + 3 . 

So 2x 3 - 3x 2 - x + 5 = (2x 2 + * - 1) (* - 2) + (2* + 3) 

_ 2x 3 — 3x 2 — x + 5 _ 2x + 3 

2x 2 + x — \ X 2x 2 + x — l 
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Now 


2x 3 2x 4 - 3 

2x 2 + x — 1 (2x — 1 ) (x 4 - 1 ) 


so we write, 


2x + 3 = A B 

(2x — 1 ) (x 4 - 1 ) 2x — 1 + x 4 - 1 

=> 2x 4 - 3 = A(x 4 - 1) + B(2x - 1 ). 

Putting x = — 1, 1 = Ax 0 — 3B => B = — j. 
Putting x = 7T, 4 = v4x | => A — f. 

So (as can easily be checked), 


2x 4- 3 = _J_ _ 

2x 2 + x — \ 2x — 1 x + 1 


and 


2 * 3 — 3 * 2 — a: + 5 
2 x 2 4 - x — 1 


* - 2 + 


8 

3 


2 * ^ 1 


3 

* 4- 1 


In this case the key to our success lay in first dividing the numerator by 
the denominator, and wherever possible (i.e., unless the degree of the 
numerator is already less than that of the denominator) this must always 
be the initial step. 


Example 3 

We now take a function whose denominator contains a repeated factor: 
* 2 4- 5x + 9 

J[X) “ (x + 3)(* + 2) 2 ' 

Again, we first try the simple partial fractions: 


x 2 4 - 5 * 4 - 9 _ A B 

(* + 3 ) (* + 2) 2 = * + 3 + (* 4 - 2) 2 

=> * 2 + 5 at + 9 = 4 (* + 2) 2 + 5 (* + 3 ). 


Equating coefficients of a: 2 , *, and the constant terms, we see that it 
would be necessary to have 


1 = A 

5 = 4 A + B ► 
9 = 4A 4 - 3B 


which are clearly inconsistent. 


So/(*) cannot be expressed in terms of partial fractions (1). 

For our next attempt, therefore, above the quadratic denominator 
(x + 2 ) 2 we place a term Bx 4 - C, so that 

x 2 4 - bx 4 - 9 _ A Bx + C 

{x + 3) (* + 2 ) 2 = * + 3 + (x + 2 ) 2 
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=> x 2 -b 5x + 9 = A(x + 2 ) 2 + (Bx + C) (x + 3). 

Putting x = — 3: 3 = ^4 =>^4 = 3. 

Equating coefficient x 2 : 1 = ^4 + i? => B = —2. 

Equating constants: 9 = 4^4 + 3C => C = — 1. 

Hence (as can easily be checked) the required partial fractions are 

x 2 + 5x + 9 _ 3 2* + 1 

(*+ 3)~(x + 2 ) 2 = x + 3 “ (x + 2) 2 * 

Sometimes it is convenient to split this last fraction still further, by 
saying that 

2x + 1 _ 2{x + 2) - 3 _ 2 3 

{x + 2) 2 " (x + 2) 2 “ jc + 2 (* + 2) 2 

* 2 + 5* + 9 _ 3 2 3 

S ° 1 at (* + 3) {x + 2 ) 2 ~ x + 3 "H2 + (H 2) 2 * 

Example 4 

Finally, we take a rational function whose denominator contains a quad¬ 
ratic term which cannot be factorised: 

n \ = * — 3 

7W (* - i )(* 2 + i )' 

As it is impossible to express x 2 + 1 in real factors, we follow the prece- 
dent of the last example and write 

x — 3 _ A Bx + C 

(x — 1 ) (x 2 + 1 ) * — 1 x 2 + 1 

=> * - 3 = A(x 2 + 1 ) + (Bx + C) (x - 1 ) 

x = 1: => — 2 = 2^4 => A = -1 

x 2 term: => 0 — A -\- B => i? = +1 

constants: => —3 = A — C => C = +2. 

So it appears that 

x — 3 _ — 1 x + 2 

(* — 1) (x 2 + 1) x — 1 * 2 + 1 

which can quickly be verified. 

Partial fractions are principally used in integration, but it is also interest¬ 
ing to see how they can help us to find the sums of certain series. 
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If, for instance, we consider the series 

1 1 1 

+ 7- 7 + 7- -+••*’ 


1x3 3x5 5x7 

we see that its rth term is 


1 


(2r - 1) (2r + 1) 

This can easily be expressed in partial fractions as 


2r - 1 2r + 1 

1/1 1 


2 \2r — 1 2r + 1 


Hence the sum of the first n terms is 

o 1 1 1 

S„ — -1-1-1-b 

" 1x3 3x5 5x7 


1 




(2b - 1) (2b 4- 1) 

1 1 


-b 1 


1 


n 


2 n + lj (2 n + 1) 


+ 


2n — 1 2 n + 


Also, as rc—> 00 , S t 


> 2 


So the sum of the infinite series is 


Exercise 9.6a 

1 Express in partial fractions 
x + 1 


(0 


(«0 


(mi ) 


(x - 1) (x - 2)’ 
x 2 

(x - 1)(* - 2) (x - 3) 


(») 

; (v) 

(■ via) 


x ■ t-\ 1 

7> ( m ) 


x 2 — 4 

x — 1 

x 2 X 

„,3 


; (vi) 


x(2x + 1)’ 
x 2 + 1 
x 2 - 2x’ 


1 


2 Express in partial fractions 


(0 


(iii) 


1 


x(x — l) 2 


(x 2 — 1) (* + 1) 


(»') 


; {iv) 


X — 1 

x 2 (# T 1) ? 
x 3 

(* - m * 2 - i)' 



3 

(0 
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Express in partial fractions 


1 


x(x 2 + 1) 5 


(ii) 


- 1 


(iii ) 


- 1 


(*») 


* 3 + f 


39 


4 Find the first four terms of the power series for 

by first expressing it 

(i) in partial fractions; 

(ii) as [1 + (3x + 2a: 2 )] - 1 ; 

(Hi) as (1 + 2 x) _1 . 


1 

(1 +*)(!+ 2 x) 


5 Find the rth term, the sum of the first n terms and the sum to infinity of 
the series 


(0 

(») 

(iii) 


1 1 1 

1 x 2 + 2x3 + 3 x 4 + " 

1 1 1 

- -|_ - _j_ - _|_ . , 

1x3 2x4 3x5 

1 1 


+ 


+ 


1 


1x2x3 2x3x4 3x4x5 


+ 


Use of partial fractions in integration 

In the next section we shall return to the general problem of integration, 
but we can already see the use of partial fractions when we need to integrate 
the quotient of two polynomials (or rational function , as it is usually called). 


Example 5 

It is quickly seen that 

* + 1 f 

x 2 - 4~ 

So / 


-JG4 


X + 2 

x 

- 4- — 

2 x + 2 


dx 


= | In (x - 2) 4- i In (x 4- 2) 4- 


Example 6 

J (* - 1) {x 4- l) 2 d *’ 

We can find the full partial fractions in such a case, with a repeated factor, 
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by letting 

x 2 A B C _ 

{x - 1)0 + l) 2 = * - 1 + X + 1 + (x + l) 2 

=> X 2 s A(x + l) 2 + B(x - 1)(* + 1) + C(x - 1). 

Putting x = 1, l = 4A => A — 

Putting x = —1, 1 = — 2 C => C — —i[. 

Equating coefficient x 2 , 1 = A + B => B — f. 


So / = 


+ 


* — 1 x + 1 (x + l ) 2 
= \ In (x - 1 ) + f In (x + 1 ) 4 


dx 


1 


2 (x -f r 


+ c. 


Example 7 


r x 2 d* 

= J (x- l)(x 2 + 1 ) * 


We can express this in partial fractions as 


Bx + C 


(x - l)(x 2 + 1) X - l x 2 + 1 
So x 2 = A(x 2 + 1) + (Bx + C) (x - 1). 

Putting x = 1, 1 = 2A 

Equating coefficient x 2 , 1 = A + B 

Equating constant term, 0 = A — C 

2 1 \x + b 


A = i 
B = b 
C = i. 


So 


__ 2 


(x — 1 ) (x 2 + 1 ) 


+ *1 _ 

x — 1 X 2 4 1 


Hence / 


_ x C dx 
2 x — 1 

ft 


+ b 


x 2 4 1 


dx 4 b 


dx 


x 2 4 1 


= £ln (x - 1) 4 iln (x 2 4 1) 4 j tan 1 x 4 c. 

In this last example we encountered a quadratic term x 2 4 1 which 
cannot be split into real linear factors (and which is therefore said to be 
irreducible). As such terms, and their corresponding partial fractions, occur 
quite frequently, our next example will show how they can best be treated: 


Example 8 


(0 


r r 

A 


2x - 6 


6 x 4 25 


dx; (ii) 


f_I 

J x 2 — 6x 


4- 25 


dx; 
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<iS) { 7-ri 


6 x + 25 


dx. 


In (i) we immediately see that the numerator is the derivative of the 
denominator. 


So 


2 x - 6 


x — 6 x + 25 


dx = In ( x 2 — 6x + 25) + c. 


In (ii) we first write 

f_I_ d *= f_ 

J x 2 — 6 * + 25 J (x — 


1 


3) + 16 


dx. 


Putting x — 3 = 4 tan 9 => dx = 4 sec 2 9 d 9 we obtain 

4 sec 2 9 d9 


J X 2 - 6x + 25 dx { 


16 tan 2 0+16 
d 0 = $9 + c. 


But x — 3 = 4 tan 9 
x - 3 


tan 9 — 


9 = tan" 


x — 3 


So 


J x 2 — 6 ; 


6 x + 25 


, , x — 3 

d* = 4 tan -- + c. 


In (tit) we express the required integral as a linear combination of the two 
previous parts: 


f _*_dx= [J&L 

J x 2 — 6x + 25 J x 2 — 6 


- 6 ) 


6x + 25 


dx + 


x 2 — 6 x + 25 


dx 


x — 3 

= \ In (x 2 — 6 x + 25) + f tan 1 - 1 - c. 


Example 9 
1= |* 3 ~~ i - dx. 

J x 3 + 1 

As x 3 + 1 = (x + 1) (x 2 — x + 1), 
we can express x/(x 3 + 1 ) in partial fractions: 

x _ x _+ Bx + C 

x 3 + l (x + l)(x 2 — x + 1 ) x + 1 + x 2 - x + 1 

So x = A(x 2 - x + 1 ) + (Bx + C)(x + 1). 
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Putting x = - 1 , -1 = A{\ + 1 + 1 ) 

Equating coefficient x 2 , 0 = A + B 

Equating constants, 0 = A + C 


A = -f 
B= +i 
C = + i. 


So 


+ 


3 X "P 3 


x 3 + 1 a :+1 ' a : 2 — a :+1 

/ 


Now 

and 


= — i f —~r dx + | f ~ —7 d*- 

J * + 1 Jx-x+l 

I —-dx = In (x + 1 ) 

J X + 1 

l (2x - 1 ) , f 


X + 1 


x 2 — X + 1 


x 2 — x + 1 x 2 — x + 1 
= ^ In (x 2 — x + 1 ) + | 


dx 


dx 


x 2 — x + 1 


But 


dx 


x — x + 1 


dx 


(X - I) 2 + t 


- v /3 

and we now let x — j = tan 0 


dx = sec 2 0 d 0 . 

2 


Hence J 7- d I + i -ji 


r^/3 sec 2 0 d 0 


2 

V3 


But x 

So 

and 


tan 2 0 + f 
sec 2 6 d 6 ! 

tan 2 0+1 3 J 

2 x - 1 Q _ 2 x — 1 

tan 0 => 0 = tan - 


de -73 e ' 


V3 


— ^ tan 6 

2 2 

r dA - _ 

J x 2 — X + 1 

3 In (x + 1 ) + In (x 2 - x + 1 ) 

X + 1 


V3 


2 2 x - 1 


x/3 


n/3 


+ f x ^ tan 


_j 2 x + 


1 


+ c 


, x 2 — x + 1 1 , 2 x — 1 

= iln 7 . T7T- + — ta " _1 --- 


(x + l ) 2 yj 3 


V3 


+ c. 


Exercise 9.6b 

Find the following integrals: 


dx 


r 


( " ) j 5 


dx 


+ x 


(Hi) 


dx 


2 x — x 2 


1 (i) 



(*») 

2 (i) 

(m 


x 2 - 4 


dx; 


(t>) 


I 
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dx 


f dx 

J (*+ i) 2; 
.... r * + 1 
in) ) 


4x 2 - r 

r dx 


(w) 


dx 


2x 2 + x — 1 


dx; 


(*») 


3 (0 
(») 
{Hi) 

4 (0 
(*») 


2x dx 
x 2 + 9 
8x dx 
4x 2 + 9 
2x - 6 




6x + 10 


x dx 


r 


f x dx f x 2 dx 

”»J <•> J 


dx 

(«) 

(") 


J 

| x 2 (x 

+ 2)’ 

\ 

x 2 

dx 

% 

(x - 

i) 2 


dx 


x 2 

+ 9 



dx 


4x 

2 + 9 



dx 



x + 3 
x 2 + 9 
2x - 3 


dx; 


4x 2 + 9 
dx 


dx; 


6x 4- 10 




6x + 10 


x 2 + 1 


dx; 


(w) 


x 3 + 1 

x dx 
x 3 — 1 


dx; 


9.7 General integration 


In the last section we learned to integrate rational functions by means of 
partial fractions. We now seek to draw together other methods we have 
employed for finding integrals. 

Faced with a particular function whose integral is required, our primary 
need is for a very thorough knowledge of differentiation so that we can try 
to find another function, or ‘primitive 5 , whose derivative is the given 
function. This may not always be possible: for instance, until we invented 
the function In x it was impossible to integrate even such a simple function 
as 1/x, and we shall frequently encounter similar examples. Nevertheless, 
our first step must be to re-state the standard derivatives and the integrals to 
which they give rise. 


t-K) 

dx 


dx 


(In x) 


( o 

dx 

d 

— (sin x) 
dx 


fix 11 1 

1 

x 


COS X 


J 

/* 

* 

1 

f 


x dx 

— + 
n + 1 

dx 

— 

= In x + c 

X 

e x dx 

= e x + c 

cos x dx 

= sin x + c 


(« ^ - 1 ) 
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— (cos x ) = — sin x 

dx 

_d 

dx 

d , i 

d * > V( ! - *) 

d , , i 

dx y ’ l + x 2 


J 

I 

I 

I 


sin x dx 

sec 2 x dx 

dx 

To - * 2 ) 

dx 

1 + x 2 


— cos X + c 

tan x + c 
sin -1 x + c 
tan - 1 x + c 


These standard integrals frequently need to be used in conjunction with 
general methods such as that of substitution or integration by parts and, 
although these have been investigated in previous sections, considerable 
practice is necessary in order to sense what is the most appropriate method 
for each particular example. We have already stressed the crucial impor¬ 
tance of integration in finding areas and volumes, and it will soon be seen 
that it is similarly central in statistics, mechanics, and throughout all 
branches of applied mathematics. 


Example i 



Here a substitution converts the integral into another which is more 
tractable, 

for x = a sin 9 => dx = a cos 6 dd. 


So 1 = 


(a 2 — a 2 sin 2 6) a cos 6 dd 
= a 2 J cos 2 6 d 6 

(1 + cos 29) d9 


-*d 


= \a 2 (9 + j sin 29) + c 
= ^ a 2 (9 -f sin 9 cos 9) T c. 

Finally, however, it is necessary to revert to the original variable x. 
So f — * 2 ) dx = \a 2 sin -1 x + \xy](a 2 — x 2 ) + c. 


Example 2 
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Because tan -1 x is a function which we can easily differentiate, there is 
an immediate attraction in using the method of integration by parts: 

/ = f 1 tan “ 1 x dx 


x - - 5 - dx. 

1 + x 2 


_! f 2, 

2 J 1 + X 2 


So I = x tan 1 x — A 


= x tan 1 x — A In (1 + x 2 ) + c. 


Example 3 


/ = cosec x dx. 


It may be recalled that sin x, cos x, and tan x are easily expressible in 
terms of tan \x : 

t — tan ix 


-75 cos x = -- 

t 2 1 + t 2 


tan x — 


So the substitution 
t = tan ttx 

is sometimes particularly convenient for simplifying trigonometric integ¬ 
rals. 


Now t = tan - 

2 


d t = \ sec 2 \x dx = A(1 + t 2 ) dx 


So / = 


2di/(i + f) _ r a , 

22/(1 + <=) + ' 


= In tan Ax + c. 


Hence cosec x dx = In tan Ax 4 - c 


and it is quickly verified that 
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d 

dx 


d 1 

— (In tan \x) = -— (sec 2 jx) x ^ 


1 


1 


2 sin ix cos bx sin x 


= cosec x. 


To complete the catalogue of the integrals of elementary trigonometric 
functions, we recall that 


tan 


, f sin x 

xdx = I - 

J COS X 


dx 


= — log COS X + c 
= log sec x + c 


. f . f sec x (sec x + tan x) 

and sec x dx = - 1 dx 

J J sec x + tan x 


sec x tan x T sec 2 x 


-f 

J sec x + tan x 
— In (sec x + tan x) + c. 


d* 


Finally, we give two examples which are quickly dealt with by a pre¬ 
liminary rearrangement: 


Example 4 

/ 


= tan 4 x 


d*. 


Now I = | (tan 2 x) (tan 2 x) dx 

= I tan 2 x (sec 2 x — 1) dx 


I 


= J (tan 2 x sec 2 x — sec 2 x + 1) dx 
= tan 3 x — tan x + x + c. 

Example 5 


f sin x 
J sin x + c< 


- dx. 


+ cos x 

We first notice that we could easily find 
sin x T cos. 


and 


sin x + cos 

cos x — sin x 
sin x + cos x 


x f 

- dx = dx = x 

x J 


dx = In (sin x + cos x). 


So 


J- 
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sm x 


sin x + cos x 


dx = ^ 


sin x + cos x 
sin x + cos * 


dx — A 


cos x — sin x 


sm x + cos x 


- dx 


= 2 * “ 2 In (sin x + cos x) + c. 


Exercise 9.7 

Find the following integrals: 


1 (i) 

(*») 

(vii) 

(x) 


5x 4 dx; (ii) | < 6 d?; 

dx 


lyjx* 

da 

(2a + 1) 


(v) 

3 ’ ( viii ) 


») J t 6 dt; (in) j—^r’> 

j" yjvdv; ( vi ) j" (x + l) 4 dx; 

j 7(3»-2) ; <“> 


f x 2 dx 

x) J (i - 


2 (o 

(iv) 

(vii) 

w 


dx; 


f vdv 

J v 2 - V 


e 2x dx; 


; (xi) 

(ii) 

(v) 

(viii) 


x N /(x 2 + 1) dx; (xii) J -q- ~ ^ 


**) 


(Hi) 

(vi) 


da 

1 - a ’ 


|1 + i 

3 (?) Jcos2xdx; 


-da; (xi) 


f x 2 dx f 2x + 1 

Jr^ ; 

2e 3x dx; (ix) Jxe - * 2 dx; 

flnx 

(xw) -dx. 

J * 

J” sin ^x dx; 


10* d/j 


(ii) 


(???) 


sin 2 9 cos 0 d0; 


(*») 

(w«) 

(x) 

4 (0 


cos 2 0sin0d0; (t/) 


I 


sin x cos x dx; (viii) J sin 2 jx dx; 
cos x 


/• 

i) sir 

* 


cos 3 0d0; (vi) sin 3 0d0; 


(ix) 


cos 2 x dx; 


dx; 


V (sin x) 
j* sec 2 x dx; 

(???) J sec 6 tan 9 d9; 


, , f cos 9 tn r 

(xt) j^9 d0; <**>J 


sin 9 
cos 6 


d 9. 


(a) 


tan 2 x dx; 


(») 


cosec 2 9 d 9 ; 


(iv) J cosec 9 cot 9 d 9 ; 
(w) J* cot 2 9 d9; 
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(vii) J 


/% 

%> 


(xi) 

5 (*) 

(ii) 

{Hi) 

(iv) 


tan 0 sec 2 0 d0; 

{viii) j sec 3 0 tan 0 d0 

tan 2x dx; 

(x) J tan 3 x dx; 

cot 2 % dx; 

'”) I co,I ‘ d '- 

dx 

(x = 2 sin 0); 

V ( 4 - * 2 ) 

f dx 

(2x = 3 sin 0); 

) V ( 9 - 4 * 2 ) 


dx 


1 + 4x 2 
dx 

4 + 9x 2 
d u 


4/ 

I 

I n /(“ 2 + !) 

r d « 

J V(« 2 -1) 

1 — x 2 ) dx (x = sin 9 ); 


{v) 

{vi) 

{vii) 

{viii) I ■*V ( 1 


(2x = tan 9); 
(3* = 2 tan 0); 
{u = tan 0); 

{u = sec 0); 


x 2 ) dx; 


(*x) 

6 (z) 


c 


dx 


f x dx 

{x) J 4? + 9’ 


4x 2 + 9’ 

xcosxdx; (ii) X dx; (in) 

* 

(iz;) Jx 2 sinxdx; (^) J sin 1 x dx; {vi) 

{vii) J 0 sec 2 0 d0; {viii) j* x 2 e 2x dx; (ix) 

(x) f e 2t cos 3t dt; (xi) 


V 

1 

J x 2 In x dx; 


; (xii) 


£ sin 3 1 d t; 
In x dx; 

sin t dt ; 
x 10* dx. 


J- 


7 Use the substitution t = tan ^x to find: 


»l"r 


dx 


+ sin x 


t—; (») 




dx 


1 0 1 4- sin x + cos x 


nn 

’ ( " ) J. i 


dx 


sin x + 


cos x 



8 Show that 
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f(x) dx = f(a — x) dx 


and hence (using no. 7 ) evaluate: 

C* x dx .. f** x dx ..... C* n x dx 

(0 —:—; («) —:— -; ( m ) --—- 

Jo 1 + sin x Jo 1 + sin x + cos x Jo sin * + cos x 


9 Find the following integrals: 


x 2 + x 9 


x 3 + x’ 
x dx 
x 4 + 1 ’ 


(*) ttt -; (**) 

2 + COS X 


Ky/{X 2 - 1 )’ 


: dx; (vi) 


( viii ) (lnx) 2 dx; (ix) x 2 tan xdx; 


10 Evaluate: 


0 (* + l)(x 2 + 1)’ [U) Jo (* 2 + 1) 2; 


1 x dx 


M dx 

o V+ l) 3/2; 


0 (* + 1) 


v) sin 5 0 cos 2 0 d 0 ; 


(vi) sin 2 0 cos 6 0 d 0 ; ( vii) x 2 In xdx; 


V ( 4 - x 2 ) dx; 

tn cos 0 — sin 0 ^ 
cos 0 + sin 0 ’ 


-tjr 3 sin 2 0 4 - cos 2 9 


d 9 

o 2 + cos 0 5 


(put t = tan 9 ); 


x - 2 
4 — x 


dx (put x = 2 sin 2 0 + 4 cos 2 0). 


9.8 Differential equations 

We have seen that the process of integration is the reverse of differentiation 
and so requires the discovery of a function having the given rate of change. 
Frequently, however, we do not know either the function or its rate of 
change, but merely how they are related. 
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When, for instance in 9.2, we were investigating the cooling of bathwater, 

we simply knew that after time t the excess temperature 9 and its rate of 

d 9 , , , 

decrease-were connected by the equation 


d 9 
d t 


= —k9 


( 1 ) 


and in the same section it was mentioned that the current x flowing in a 
particular electrical circuit obeyed the equation 


d 2 x dx 

—~2 + 2 —— 
dt 2 d£ 


+ 5 * = 0 . 


( 2 ) 


Similarly the rate of growth of the mass m of an organism frequently 
depends on the equation 



( 3 ) 


and the rate of spread of a disease in a population, of which a fraction x is 
infected, is often governed by the equation 


— = kx( 1 - x). 
d t 


(4) 


Any such equation which connects the rates of change of variables with 
their values is called a differential equation. Equations (1), (3), and (4) are 
differential equations of the first order and equation (2) is of the second order , 
so called from the degree of their highest derivatives. 

In particular, such a first-order equation as 
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This is called its general solution; and by giving c a particular value we can 
obtain particular solutions, such as 

y = a* 3 + 2 or y = jx 3 - 4. 

Graphically, we see that the general solution is represented by a family of 
curves , and that each particular solution is an individual member of this family. 

Two further examples will illustrate this relationship. 


Example i 
If x 2 + y 2 = a 2 , 

form a differential equation independent of a. 

As a varies, the equation 

x 2 + y 2 — a 2 

represents a family of circles, all with centre at the origin. 



But x 2 + y 2 = a 2 


d u 

2x + 2 y —— = 0 
dx 


dy 

x + y — = 0. 
dx 
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This is the required differential equation, which can be written as 

- _i. 

x dx 

and since the gradient of OP is y/x , this equation is simply expressing the 
fact that the tangent to each circle at every point is always perpendicular 
to the radius. 

Example 2 

A point P is moving along a straight line and after time t its displacement x is 
given by 

x — A sin 2t + B cos 21. 

Form a differential equation which is independent of A and B. 

As we here need to eliminate two arbitrary constants, we shall have to 
differentiate twice and form a second order differential equation. 

= A sin 2t + B cos 2 1 
= 2A cos 2 1 — 2 B sin 2 1 

= — 4 A sin 2 1 — 4 B cos 2t. 

= —4x. 

This, therefore, is the required equation, showing that for all values of 
A and B the acceleration is always — 4x, and we see that 

x = A sin 2t + B cos 2 1 

is a general solution of this equation. 

More frequently, however, we start with the differential equation and 
need to find appropriate solutions. In this case, for instance, the equation 



might have arisen from investigation of a buoy bobbing up and down in 
the sea, and our task would generally be to show that this leads to a solution 
of the form 

x = A sin 2t + B cos 2 1, 
representing a periodic oscillation. 

Exercise 9.8a 

1 Verify that the following differential equations have the solutions indi¬ 
cated (where A, B , e are arbitrary constants): 


Now x 


dx 


~dt 


H 

CM 


d t 2 

So 

d 2 x 
d t 2 




9-8 DIFFERENTIAL EQUATIONS 53 


(i) — = 3x; x = A e 3t . 
d t 

(\y 

(ii) x— =2 y\ y = Ax 2 . 


(: Hi) 



x; 


x 2 — y 2 = A. 


(*») 

(») 

(«*) 


^ = *(1 - x); x = 1 _ t - 

d£ 1 + .<4 £ 

d 2 x dx ^ _ _ 9t 

— T 4- 3 — + 2x = 0; x = A e 1 + B e 

d t 2 d t 


d 2 x 


dx 


—2 + 2 — + 5x = 1; x = ^4 e 1 sin (2/ + e) + 


1 

5* 


2 Find differential equations which are satisfied at all points of all 
members of the following families of curves (where A , B , r, etc. define the 
individual members of the families): 

(i) xy = c; (ii) y = A e x ; (Hi) y 2 = ax; 

(iv) x 2 — y 2 — a 2 ; (v) y = Ax 2 + Bx; (vi) x — A e l + B e~ l . 

We have now begun to see the relationship between a differential 
equation and its solutions. As with algebraic equations, the task of the 
mathematician is usually twofold: 

(i) to express a given situation — whether mechanical, electrical, chemi¬ 
cal, biological, economic, or whatever—in a suitable mathematical form 
(usually called a model), which very frequently leads to a differential 
equation; and, 

(ii) to find the appropriate solution of such a differential equation. Very 
often this has to be done numerically, but we shall begin to discuss the 
problem by taking three special types of differential equation for which it is 
usually possible to find a general solution. 


First order equations with separable variables 
A typical first order equation is 


dy_ 

dx 


= /(*> y). 


and if/(x, y) is defined at every point of the x—y plane, it follows that at 
each point a corresponding gradient and direction are known, just as in a 
magnetic field plotted by a compass needle. 
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Example 3 

d y x 
dx y 

d y 

The values of the gradient — can be indicated by a table: 

d* 



-3 

-2 

-1 

X 

0 

1 

2 

3 


3 

-1 

2 

3 

1 

3 

0 

1 

3 

2 

3 

1 


2 

3 

2 

-1 

1 

2 

0 

2 

1 

3 

2 


1 

-3 

-2 

-1 

0 

1 

2 

3 

y 

0 









-1 

3 

2 

1 

0 

-1 

— 2 

-3 


-2 

3 

2 

1 

2 

0 

~ 2 

-1 

3 

2 


-3 

1 

2 

3 

i 

0 

1 

3 

2 

3 

-1 


These can then be plotted at the different points of the plane and already 
we can begin to see the family of solutions, looking like the lines of force of a 
magnetic field. 


\ 

\ 


^ - 

y 



/ 

\ 

\ 

\ 

\ - 

- - 


/ 

/ 

\ 

\ 

\ 

\ - 

- / 

/ 

/ 

/ 

\ 

- 1 

\ 

1 

\ 

1 

\ - 

- / 

1 _ 

/ 

1 

/ 

1 

/ 

1 

/ 

1 

/ 

1 

/ 

1 

/ - 

1 

- \ 

1 

\ 

1 

\ 

\ 

/ 

/ 

/ 

/ - 

- \ 

\ 

\ 

\ 

/ 

/ 

/ 

- - 

- \ 

\ 

\ 

\ 

/ 

/ 



- — 


\ 

\ 


But we can also approach the problem quite differently. For the equation 

dy x dy 

— — - can be written as y — = x 
dx y dx 
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=> J y dy — J x dx 

=> \y 2 — \x 2 + c 

=> y 2 = x 2 + 2c. 

This, therefore, is the general solution of the given equation; and we can 
obtain different particular solutions by giving different values to the constant c. 

Graphically, we see that each particular solution is a certain curve (in 
fact a rectangular hyperbola, each one having two branches), and the 
general solution represents a family of such hyperbolas. Finally, the sketch 
of these hyperbolas should be compared with our original ‘compass- 
needle’ figure. 


y 

c = 2 




c = — 2 => y 2 = x 2 — 4, etc. 


more rapidly by 

( 1 ) 

(la) 

( 2 ) 

h 2 = ix 2 + r 

y 2 = x 2 + 2c. 


In the above solution we could have proceeded a little 
saying that 


dy 

dx 


can be written 
and then as 


y dy = x dx 

x dx 


ydy = j*. 
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Although the statement (la) has not (as yet) any proper meaning, we 
have already justified the argument from (1) to (2), and (la) clearly serves 
as a very useful intermediary. This is usually known as separating the variables : 
any first order equation which can similarly be expressed as 

f ( y ) dy = g0 ) dx 

is said to have separable variables , and its general solution can be written as 
[ f { y ) dy = f g(x) dx. 


Example 4 

Solve the differential equation 
dy = _y_ 

dx x 

Here again, the variables are separable and we can write 
d y dx 

y x 

=> \n y = —\n x + c 
=> In xy = c 

=> xy — a (where a = e c ). 

The solution-curves are therefore: 



Since x/y X y/x = — 1, it is clear that the directions at any point of the 
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solution-curves of the two equations must be perpendicular, and this is 
very clearly seen when the figures are superimposed: 



Such pairs of families of curves which are perpendicular everywhere they 
meet are called orthogonal families and are outstandingly important not 
only as lines of forces and their corresponding equipotentials , but throughout 
applied mathematics. There is indeed a particularly simple illustration of 
them on a map as the contours and the lines of greatest slope of a hillside, and 
this last diagram can be regarded as the map of a saddle-point with rainfall 
streaming from the hillsides in the North-East and South-West into the 
valleys in the North-West and South-East: 
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Example 5 

A stone falls vertically from rest against an air resistance which is propor¬ 
tional to its velocity v 9 so that its acceleration is g - kv, where g and k are 
constants. 

(z) Write down differential equations connecting v with t, the time; and 
v with x, the distance fallen. 

(zz) Find expressions for v in terms of t and x in terms of t. 

{Hi) Find the terminal velocity, as t —> 00 . 

(z) The downwards acceleration 

dv dx dz; dv 

d t d£ dx dx 


So 


dz; 

d£ 


= g - kv 


and v — = g — kv. 
dx 

(zz) From (1), we see that 


dv 


g- kv 


= d/ 


-In {g — kv) = t + c. 

k 


But when t = 0 , z> = 0; so c — —— In g. 


Hence 

In I 1 


1 1 1 g — kv 

. t = -ln(g-kv) --]ng = -ln—j- 


kv 


= —kt 


g (1 

'■* (i 


kv 

1- = e 

g 


" kt\ 


dx g kt 

Hence — = - (1 - e ) 
d t k 


X ‘k\ t + V 


+ d . 


( 1 ) 

( 2 ) 


( 3 ) 


But when t = 0, x = 0; so d — 

Hence x = y- + ( e ~ k% “ 0- ^ 

k k 

Finally, we see from (3) that as t— > oo, v^> g/k, which is therefore 
called its terminal velocity. (And the same value can be found more simply 
from equation ( 1 ) if we suppose that the acceleration eventually tends to 
zero.) 
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Example 6 

The rate at which a tree takes in its 4 food ’ is proportional to the area of its 
rooting surface. The food is used 

(i) to maintain the tree — this requires an amount proportional to its 
volume, and 

(ii) for it to grow — this requires an amount proportional to the rate of 
increase of volume. 

If the complete tree, including roots, retains a constant shape show that 
its height h satisfies a differential equation of the form 


d h 
d 1 


+ 


Ah = B 


and that the tree will tend to a maximum height. 

A tree is planted when it is 2 m high and its initial rate of growth in 
height is 0.3 m per year. Its final height is 100 m. How long does it take to 
reach a height of 50 m? (m.e.i.) 


This example shows how a so-called 4 mathematical model’ can be set 
up to describe a given situation; whether or not the model is valid can then 
be tested from the accuracy of its predictions. 

Suppose that after time t the volume of the tree is V and the area of its 
rooting surface is S. Then, since the tree retains a constant shape, 

S = ah 2 , V = bh 3 (where a , b are constants). 


Also the rate at which it takes in its food is kS. 


dV 

kS = cV + d — (where £, r, d are constants) 

kah 2 = cbh 3 + d x 3 bh 2 — 
d t 


ka = bch + 3 bd 


. dh 
d t 


dh 

— T Ah — H. 
dt 




d^ 

When t = 0, h = 2 and — = 0.3, so 
dt 


0.3 + 2A = B. 


( 2 ) 


dh 

Also, as the tree nears its final height, A —»• 100 and — —> 0; so 


100 A = B. 

From (2) and (3), A = 0.3/98 and B = |f. 


( 3 ) 
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Hence 


d h 0.3 , 30 

— + — h — — 
d t 98 98 

dh 

98— = 0.3 (100 - h) 


_ 98 d h 

^ 100 - h 

=> -98 In (100 - h) 


0.3 dt 
0.3£ -f- A. 


But when t = 0, h = 2; so 


-98 In 98 = A 

- °- 3 ‘ = 98ln !o^T < 4 > 


Putting h = 50, 


0.3f = 98 In ft 
98 

t = — In 1.96 
0.3 

98 x 0.672 9 


0.3 


220 . 


So the tree would take just over 220 years to grow to a height of 50 m. 

Moreover, from (4) we can obtain a more general formula for its height h 
after t years. 


For In 


100 - h 
98 

100 - h 


98 


_0.3* 

“"98” 

0.3t/98 

100 - 98 e ~°- 3t ‘ 98 
100 - 98 £ -0 * 00306 f 
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Exercise 9.8b 


1 For each of the following equations, 

(a) illustrate the equation by means of a compass-needle diagram; 

( b ) find and illustrate its general solution: 


(0 


dy 

d* 

dy 

dx 



dy 

r -s d y 

= y; 

(«) -7- = -*y> 

ax 

(in) — 

dx 

2 

M d ? y. 

W 7- = -; 
dx x 

(vi) — 

= -r; 


2x 

y 


2 For each of the following families of curves, 

(a) eliminate the parameter c to form a differential equation which repre¬ 
sents the family (i.e., is true at every point of every curve); 

( b ) illustrate by means of a compass-needle diagram; 

(r) find the differential equation representing the orthogonal family; 

(d) find and illustrate the general solution of this equation: 

(?) y = cx\ (ii) y = x + c\ (iii) y = cx 2 ; 

{iv) y 2 = x + c; ( v ) y=ce~ x ; (vi) y=~r 

3 (i) Solve the differential equation 
dy 


dx 


= tan y cot x, 


given that, when x = \n, y = \n. 

(ii) Find the solution of the differential equation 
dy 

x — = sin y 
dx 

(for x > 0 ) which passes through the point ( 2 , ^n) 


(o.c.) 


(S.M.P. 


4 The size S of a population at time t satisfies approximately the differen- 
dkS 

tial equation — = kS , where k is a constant. Integrate this equation to 
find S as a function of t . 

The population numbered 32 000 in the year 1900 and had increased 
to 48 000 by 1970. Estimate what its size will be (correct to the nearest 
1 000 ) in the year 2000 . (s.m.p.) 


5 If a ship’s mooring rope is wrapped round a rough bollard, with co¬ 
efficient of friction )U, and is on the point of slipping, it can be shown that 
its tension T an angle 6 from its taut end is such that 
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Deduce that 
T = T 0 e~^ e , 

where T 0 is the tension from the ship. If fi = show that one complete 
turn will reduce this tension in the ratio 1 :e n (or approximately 1:23). 

6 If the rate of erosion of mass of a spherical pebble is proportional to its 
surface area and it is half-eroded after 100 years, how much longer will it 
last? 

dv 

7 Show that the acceleration — of a particle moving in a straight line 

d£ 

can be written, in terms of its velocity v and its displacement x from a point 
d^ 

of the line, in the form v -— 
ax 

At a distance x km from the centre of the Earth the gravitational accelera¬ 
tion in km s~ 2 is given by the formula c/x 2 where c = 4 x 10 5 . If a lunar 
vehicle 10 000 km from the centre of the Earth is moving directly away 
from it at a speed of 10 km s ” 1 , at what distance will its speed be half that 
value? (s.m.p.) 

8 A rumour is spreading through a large city at a rate which is propor¬ 
tional to the product of the numbers of those who have heard it and of 
those who have not heard it, so that if x is the fraction who have heard it 
after time t, 

^ = kx{ 1 -x). 
dt 

If initially a fraction c of the population has heard the rumour, deduce 
that 

c 

x —-i— 

c + (1 - c) e~ kt 

If 10% have heard the rumour at noon and another 10% by 3.00 p.m., 
find x as a function of t. What further proportion would you expect to have 
heard it by 6.00 p.m.? 

9 The inhabitants of a country are in two racial groups which do not 

mix reproductively. Each group’s population increases at a rate propor¬ 
tional to the population. At a certain stage 80% are of type A, type B will 
double in 50 years, but the whole population will take 100 years to double. 
How long will it take group A to double? How long will it take before the 
two groups are equal in numbers? (m.e.i.) 

10 Heat is supplied to an object at a constant rate Wa joules s“ 1 ; heat 
is lost to its surroundings at a rate of b W6 joules s ” 1 , where W is the number 
of joules needed to raise the temperature of the object by 1 °C, 6 °C is 
the difference in temperature between the object and its surroundings, 
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and a , b are constants. Obtain a differential equation to connect 8 with 
the time t. 

Use this mathematical model to solve the following problem. An electric 
kettle, open to a room temperature of 20 °G, reaches 70 °C in 3 min after 
being switched on, and reaches boiling-point in a further 3 min. Find the 
time taken to reach boiling-point from 20 °C if it is completely heat insula¬ 
ted. 

What deficiencies has the mathematical model in representing the actual 
physical situation? (m.e.i.) 

1 i The food energy taken in by a human body goes partly to increase the 
mass and partly to fulfil the requirements of the body; these daily require¬ 
ments are taken to be proportional to the mass M. The rate of increase of 
mass is proportional to the number of joules available for this. Write down 
a differential equation connecting M, the time t and the daily intake of 
joules f(f). 

A man’s mass is 100 kg; if he took in no energy he would reduce his 
weight by 10% in 10 days. How long would it take him to reduce by this 
amount if, instead, he took in exactly half the number of joules needed to 
keep his mass constant at 100 kg. (m.e.i.) 

12 A stone is thrown vertically upwards with velocity w, and air resistance 
is proportional to the square of its speed ( = kv 2 per unit mass). 

Find: 

(i) the differential equation connecting its velocity v and the height x it 
has then reached; 

(ii) x in terms of v; 

(in) its maximum height; 

(iv) its terminal (or limiting) velocity. 

13 A flask contains 10 dm 3 of a solution of water and a thoroughly dis¬ 
solved chemical. Each minute, 2 dm 3 of water flow into the flask and 
3 dm 3 of solution flow out of the flask. Write down an expression for the 
number of kilograms of chemical per cubic decimetre in the solution after 
t min. If there are x kg of chemical in the flask at this time, show that 

d* — 3x 
d£ 10 — t 

Find the general solution of this equation. If there were 1 kg of chemical 
in the flask at time t = 0 , find the mass of chemical in the flask after 5 min. 

(a.e.b.) 

14 The rate of increase of thickness of ice on a pond is inversely propor¬ 
tional to the thickness of ice already present. It is known that, when the 
thickness of the ice is x cm and the temperature of the air above the ice is 
— a °G, the rate is (a/14 400 x) cm s - h Form an appropriate differential 
equation, and hence show that, if the air temperature is — 10 °C, the time 
taken for the thickness to increase from 5 cm to 6 cm is a little more than 

2 h. (C.) 
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Linear equations of first order 

A particularly important class of differential equations consists of those 
which are linear , where y and its derivatives occur only in linear combina¬ 
tions even though their coefficients may be functions of x. So a linear 
equation of the first order can be written as 

+ g{x)y = h{x). ( 1 ) 


Example 7 



+ 2 xy = 1 . 


We quickly notice that this is a particularly convenient equation since it 
can be written as 

and, as the left-hand side is an exact derivative, the original equation is 
also called exact. 

Proceeding with the solution, we see that 


to the more general linear equation ( 1 ) we see that it 

h(x) 

/(*) 

Q ( 2 ) 

where P and Q are functions of x. 

This is usually taken as the standard form of the first-order linear equa¬ 
tion. Unfortunately (unlike Example 7) it is not usually exact, so we now 
investigate whether there is any way in which it can be made into an exact 
equation. 

If we multiply each side of (2) by another function, denoted by R , it 
becomes 

R ?r + PR y = Q R > 

ax 

and we see that the left-hand side would be our exact derivative if only it 
could be written as — (Ry ). 


x 2 y — x + A 

1 A 

y = - + -2 


If we now return 
can be written as 

dx f(x) y 


dy 

rl v 


+ Py = 
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But t {Ry) =R fx + ^ y - 

dy 

So i?——|- PRy is an exact derivative provided that 


d R 
dx 
d R 
~R 

In R 


= PR 


= P dx 


=1 


P dx 


o R = e S pdX ' 

As this function enables us to transform (2) into an exact equation, 
e SPdx is usually known as an integrating factor. By this means the equation 
becomes 

e iPix^L + p JPAXy = Qe SPix 

dx 

=> ^ (e SPix y) = Q 

and the problem is reduced to one of ordinary integration. 

Two further examples will make this clear: 


Example 8 
dy 

-r = x + y. 

dx 
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Unlike Example 7 , this is not an exact equation. But it can be written as 

dy 

3 — y = 

dx 

so that a possible integrating factor is 

e $Pdx = e i- Id* = 


Hence e x — — e x y — x e x 
dx 


j- (* x y) = xe ~ 

ax 


e~ x y = 


x e 


y = A e x — x — 1. 



d y 

sin x -—I- 2y cos x = 
dx 


1 . 


In standard form, this becomes 
d y 

-—|- (2 cot x) y = cosec x 
dx 
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and the necessary integrating factor is 

2 cot x dx __ ^2 In sin x __ ^lnsin 2 x __ g-^2 ^ 

So the equation becomes 
dw 

(sin 2 x) ——h (2 sin x cos x)y = sin x 


T" (y si ° 2 *) 

dx 


sin x 


y sin 2 x = — cos x + A 


cos x 


y = 


Exercise 9.8c 

1 Find the general solutions of the following equations: 


dy 

(.) 

x 

O-l Q- 

* l^ss 

It 

(Hi) x ~ — 2y = (x — 2 ) e x \ 
dx 

dy 

(tv) — + y = 

dx 


2 Find the solutions of the following equations such that y = 1 when 
x = 0: 

(i) (1 - Tjf - x y = x; (”) 0 + *) ^ = *(y + !)• 

3 The current i in a certain electric circuit satisfies the equation 
di 

-h 1 — sin t. 

d t 

If it is known that i = 0 when t = 0, find i in terms of t. To what does 
this approximate as t —> 00 ? 


4 A radioactive substance P decays and changes (without loss of mass) 
into a substance Q, which itself similarly changes into a third substance R. 
R suffers no further change. The masses of P, Q, and R present at time t are 
given by /?, q , and r respectively. The rates of change are such that 

f=-2p and ( 1 ) 

d t d t 

Show that 


d q 

di = 2p ~ q ' 

Initially (at time t = 0) there is 1 g of substance P and none of sub- 
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stance Q. Integrate equation ( 1 ) and hence show that q satisfies the dif¬ 
ferential equation 

< dt + q = 2e ~ 2 '- (2 > 


Show that (2) may be written in the form 

d V> = 2*-, 

and integrate to find q as a function of t. Hence prove that, at any sub¬ 
sequent time, there is never more than \ g of Q present. (s.m.p.) 


--dLftJL- 

} 


-VW-II-1 

A 

— -'X,- 


--X,- 


E q sin ojt 
(0 


E 0 sin cot 

(«) 


In each of the above circuits an alternating potential E 0 sin cot is applied 
and the current i satisfies the following equations (respectively), where 
R , L, C are constants, known as resistance, inductance, and capacitance. 

di .. d i 1 

(i) L -—h Ri = E 0 sin cot; (ii) R— -\- —i = coE 0 cos cot. 
dt d tC 


In each case, find: 

(a) the appropriate integrating factor; 

(b) the general solution for i, as a function of t; 

{c) the ultimate ‘steady’ alternating current (when t —> oo and any 
‘transient’ terms have faded away). 

[It may be assumed — or proved by repeated integration by parts — that 


I 


e kt sin cot d t = 


e‘ 


,kt 


and e kt cos cot d£ = 


k 2 + co 2 
e kt 

k 2 -f co 2 


{k sin cot + co cos cot) 


(co sin cot — k cos cot)~\. 


Linear equations with constant coefficients 

A particularly important (and fortunately simple) linear differential 
equation is one in which the coefficients of y and its derivatives are all 
constants. We shall consider the second-order equation 
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(i) 


d 2 y , , d y 


+ b— + cy = /(*) 


dx 2 dx 


where a, b, c are constants. 

The solution of such equations can be divided into two parts: 
(a) finding a general solution of the associated equation : 


d 2 y d y 

a d? + b Tx 


+ cy = 0 


( 2 ) 


in which f (#) has been replaced by zero. This solution is known as the 
complementary function (cf). 

(. b ) finding a particular solution of the original equation (1). This is called a 
particular integral (p.i.). 

These processes will be clear when we take particular examples, but the 
general method can be summarized as follows: 


(a) The complementary function 

To find a general solution of the associated equation 


d 2 y , , &y 
a dS +b Tx 


-h cy — 0 , 


( 1 ) 


we put y — e mx and observe that this is a solution provided that 

am 2 e mx + bm e mx + c e mx = 0 
o am 2 + bm + c = 0 . 

So if this equationf has roots a and /?, it follows that e** and e^ x are both 
solutions of equation ( 2 ). 


Moreover, 


if 

u = 


d u 

then 


dx 


d 2 u 

and 


dx 2 


A e ax T B e px (where A, B are any constants), 
Acc e« x + BP e Px 

Aa 2 e ax + Bp 2 f x . 


| The very important case when am 2 -f bm + c — 0 does not have any real roots will be 
considered in 10.5. Meanwhile, we see that if it has identical roots a, a, then 

u = A f* + B e™ = (A + B) «**, 

which effectively contains only one arbitrary constant A + B, rather than two independent 
constants A and B. But in this case it can quite easily be shown that x «“* is another solution, 
so that 

u = A e™ + Bx 

is also a solution, thus restoring two independent constants. 
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TT d 2 u d u 

Hence a + b -—V cu 
dx ax 

= a(Aoc 2 e ax + Bfi 2 e px ) + b{Aot e ax + Bfi e fix ) + c {A e ax + B e px ) 

= A (aot 2 + bot + c) e ax + B(afi 2 + bp + c) 

= 0 + 0 = 0 . 

So u = A e ax + B is also a solution of (2). This is therefore the com¬ 
plementary function. 


(b) A particular integral 

The discovery of a particular solution of the original equation is frequently a 
matter of trial and error, and we shall restrict ourselves to the simplest 
cases. But once we have found such a p.i. we can immediately combine it 
with the c.f. to provide a general solution of the original equation: 

For suppose that v(x) is a particular integral. 

d 2 v <\v 

Then a —« + b -—b cv = fix). 
dx 2 dx 


But the complementary function u(x) was such that 


2 u du 

a ~ 7~2 + ^ “i—b cu = 0 , 
dx dx 


Hence a —x (u + v) + b (u + v) + c {u + v) = f{x) 

dx d* 

and we see that u(x) + v(x) is a general solution of the original equation. 
So for such a linear equation, 


general solution = complementary function + particular integral 


Example io 

Find a general solution of the differential equation 

d2 y , d y R , , 

— + 37 ~ oy = x + 1. 


dx 


dx 


(a) Complementary function 
The associated equation is 
d 2 y dy 

— + = 0 


dx 


dx 
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and y = e™ will be a solution provided that 

m 2 e mx + me mx - 6 e mx = 0 
o m 2 + m — 6 = 0 

<=> (m — 2) (m + 3) = 0 
<=> m = 2 or —3. 

So the c.f. is 
y = Ae 2x + Be~ Sx . 


( b) Particular integral 

If we seek for y as a function of * such that 
d 2 y d y 

we naturally first look amongst simple linear functions of the form 
y = ax + b. 

Then ^ = a 
dx 


and 


So 


d 2 _y 

dx 2 


= 0 . 


d 2 y dy 

dx 2 dx 


6 y = x + 1 


^ — 3 ? 6 ■ 


<=> 0 + a — §(ax + b) = x + 1 

o a — —^ and # — 6 Z> = 1 => 

So a p.i. is y = — ^x — 3 ^ 

and a general solution of the original equation is 
y = Ae 2x + Be~ 3x -ix-^. 

We can clearly also apply this method to similar first-order equations. 
For instance, the equation of Example 8 can be written as 


Example n 


dy 

dx 


y = x 


and we can immediately obtain the necessary complementary function 
and particular integral: 


(a) Complementary function 

Put y = e mx in the associated equation 
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Then m e mx - e mx = 0 

=> m = 1 

so that the c.f. is y = A e x . 

(b) Particular integral 

First we try y = ax + b. This would be a p.i. provided that 

a — (ax + b) = x 
=> — ax+(a — b)=x 

=> a — — 1 , <2 — ^ — 0 

=> a = b = — 1. 

So a p.i. is y = — * — 1. 

Hence the general solution of the given equation is 
y = Ae x - x - 1. 


Example 12 

If an alternating potential is applied to a certain electric circuit then (using 
suitable units) the current x after time t is such that 

d 2 x dx 

—7 + 4— + 3x = sm t. 
d t 2 dt 

Find the general solution. 


[a) Complementary function 
The associated equation is 

d 2 * dx 

—2 + 4— + 3* = 0 
dt 2 dt 


and x = e mt is a solution provided that 

m 2 e mt + 4 m e mt + 3 e mt == 0 
o m 2 + 4m + 3=0 
<=> m = — 1 or — 3. 

So the c.f. is x = A e~ x + B e~ 3t . 


(b) Particular integral 

Perhaps we would first try to find a function of the type 
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x = a sin t 

which satisfies the original equation for all values of t. 

But this would entail 

— a sin t + cos t + 3a sin t = sin t 
=> 2 a sin t + cos £ = sin /. 


This identity is clearly impossible, so we now try 
x = a sin t + b cos /. 


dx 

In this case — = a cos £ — b sin £ 
d£ 


and 


d 2 x 

d 7 


— — a sin t — b cos t. 


So (— a sin t — b cos t) + 4 (a cos £ — £ sin f) + 3 (a sin / + b cos t) = sin t 
=> (2fl — 46) sin t + (4« + 26) cos t = sin t 
^ 2a - 46 = ll 


4a + 2b = 0 


^ - 10 5 " 


and we see that 

jc = Yo sin ^ | cos £ is a p.i. 


Hence a general solution is 
x = A e~ x + B e~ 3t + yo ( § i n t — 2 cos t). 

In a particular case, the constants A and B would usually be determined 
by the initial conditions, e.g., from one’s knowledge of the current x at 
t = 0 and how it was changing. But it is noticeable that both terms of the 
complementary function A e~ l + B e~ 2t tend to zero as t — > oo ; and this 
happens very rapidly, so that in every case the steady-state current after 
the initial surge (or other transient effect) is 

x « jo ( s i n t ~ 2 cos t). 


Exercise g.8d 

i Find the general solutions of: 
d 2 ?y dy 

(0 T 2 - 4 ;r + 3y = 0; 

dx dx 

(■v) 0 - 4 - 5 » - °; <»■> 

(..) S - 4 - 0; (,) 


d 2 y 
d? 
d 2 y 

dx 2 


4 y - 0 ; 



+ \y = 0. 
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2 Verify that the given expressions are particular integrals of the corres¬ 
ponding equations, and hence find their general solutions: 


d2 y 

dx 2 dx 

d 2 y 3 d y 

dx 2 dx 


+ 2 y 
+ 2 y 


(iii) 


d 2 x dx 

~y~2 3 — 

dt 2 d£ 


+ 2 x 


6 ; y = 3. 

x 2 ; y = |(2x 2 + 6 x + 7). 
x = 


(iv) 


d 2 x dx 

-7-9- + 3— + 2x = sin t\ 
dt 2 dt 


x = jo (sin £ — 3 cos t). 


3 For each of the following equations find the complementary function, 
a particular integral and the general solution: 


(0 

d 2 y 

_ 9 ^ 

dx 2 

dx 

(«) 

d 2 y 

dy 

+ 2-f 

dx 

dx 2 

(Hi) 

d 2 x 

dx 

dt 2 

dt 

{iv) 

d 2 e 

d e 

dt 2 

+ 3 3“ 
d£ 


— 3 y = x; 
-3 y = e~ x ; 


+ 2 x = sin t ; 


+ 20 = cos /. 


4 Find the solutions of the following equations which satisfy the given 
initial conditions: 


(0 

d 2 y d y 

dx 2 dx 

+ 2 y = 0 

; y = 3 

and i-4 
dx 

when x = 0 ; 

(ii) 

d 2 x 

d? =fa; 

x = 0 

dx 

and — = 
dt 

: 4 when t — 0 

5 

(in) 

d 2 e do 

dt 2 + dt 

- 20 = 4; 

0 = 0 

and 

d/ 

when t = 0 . 


5 The needle of a certain heavily damped instrument makes an angle 0 
with its zero position after time t , and it can be shown that 


d 2 0 d 0 

tt 5— 
d / 2 d£ 


+ 60 = 0 . 


Find an expression for 0 in terms of t if initially (i.e., when t — 0) the 
needle was at the zero position but moving at 2 rad s -1 . Hence find the 
maximum deflection of the needle and sketch the graph of 6 against t. 


6 
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If a potential E is applied to a simple circuit with inductance L , resistance R 
and capacitance C, it can be shown that the charge q on the capacitor after 
time t must satisfy the equation 





= E. 


Find the general solution of this equation for the circuit in which 
L = 5 x 10~ 3 (henry), R = 6 (ohm), C = 10“ 3 (farad) when the applied 
potential is 100 (volts), and show that the charge on the capacitor very soon 
becomes yq C. 


7 In Exercise 9.8c no 5 (?) and (??), (a) find the corresponding comple¬ 
mentary functions and appropriate particular integrals. 

Hence proceed to parts (b) and (c). 


9.9 Hyperbolic functions 


Earlier in this chapter we introduced the two functions e x and e~ x and we 
now proceed to consider their semi-difference and semi-sum. For reasons 
which will soon become clear, these are called sinh x and cosh x, the. 
hyperbolic sine and hyperbolic cosine (or, more familiarly, ‘shine 5 and ‘cosh’). 


So 


sinh x — \[e x — e x ) 
cosh x = \(e x + e~ x ) 


Firstly, we see that 

sinh ( — x) = -$( e ~ X ~ O = — sinh x 
and cosh (— x). = + e x ) = coshx; 

=> sinh x is an odd function 
and cosh x is an even function. 


This is also clear from considering the graphs of e x and e x , and of 
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sinh x and cosh x : 



The first similarity with the trigonometric functions is noticed if we find 
the derivatives of sinh x and cosh x. 

For — (sinh x) = {\(e x — e ~ x )} 

dx dx 

= \{e x *f e~ x ) = cosh x 

~~ (cosh x) = {\{e x + e~ x )} 

dx dx 

= ^(e x — e~ x ) ~ sinh x. 

Further, we notice that 

cosh 2 x = \{e x + e~ x ) 2 = \(e 2x + 2 + e~ 2x ) 

sinh 2 x = £( e x — e~ x ) 2 = i(e 2x — 2 + e~ 2x ). 

So cosh 2 x — sinh 2 x = 1 

and cosh 2 x + sinh 2 x = \{e 2x + e~ 2x ) 

= cosh 2x. 

Also sinh 2x = |-( e 2x — e~ 2x ) 

= U(e x ) 2 ' (O 2 } 

= - e' x ){e x + e~ x ) 

= 2 x {{e x - e~ x ) x {{e x + e~ x ) 

=> sinh 2x = 2 sinh x cosh x. 

Summarising these elementary properties, we see close similarities, as 
well as subtle differences, between the hyperbolic and the trigonometric 
functions, the only property of the latter without parallel being that of 
periodicity: 
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— (sinh x) — cosh x — (cosh x) = sinh x 
d# dx 

cosh 2 x — sinh 2 x = 1 

sinh 2x — 2 sinh x cosh x 

cosh 2x = cosh 2 x + sinh 2 x 


Lastly, and again by analogy with the trigonometric functions, we 
define 


sinh x 

tanh x =--— 

cosh x 


sech x = —-— 
cosh x 


cosh x 
coth x — 

sinh x 


cosech x = — 

sinh x 


Exercise 9.9a 

1 Show that the hyperbola 



can be represented parametrically by the hyperbolic functions 
x = a cosh 6 , y = b sinh 9. 

Find ~~ in terms of 6. 
dx 


2 Show that: 


(i) tanh x = 


- 1 


« 2 * + r 


(ii) tanh x is an odd function; 


(Hi) — (tanh *) = sech 2 x; 
d^ 

and sketch the graph of tanh x. 


3 Differentiate with respect to x: 

(i) sinh 2x; (ii) cosh^x; (Hi) #cosh%; 

(v) In cosh x\ ( vi) sechx; (vii) cosech x; 


(iv) In sinh x ; 
(viii) In tanh jx. 


4 Find the following integrals: 


(i) Jsinh^d^; (ii) Jcosh2xd^; (Hi) J 


sech 2 x dx ; 
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I 


iv) tanhxdx; (v) 


J 


cosh x dx ; 


I 


x sech 2 x dx. 


5 Calculate: 

(i) the area beneath the curve y — cosh x between x = — 1 and x = + 1; 

(ii) the volume formed when this area is rotated about the x-axis. 


6 Use Maclaurin series to find the power series for: 

(i) sinhx; (ii) cosh x. 

7 A bead is at rest on a straight smooth horizontal wire which is then 
made to rotate with constant angular velocity co about one of its ends. It can 
be shown that its distance r from the point of rotation will satisfy the 
equation 


Show: 

(i) that the general solution of this equation is r — A + B e~ m 
and (ii) that if initially r = a, then r = a cosh cot 

(Hi) that if the wire has rotated through an angle 8 , then 8 = cot and the 
polar equation of the bead’s path is r — a cosh 8. 


Inverse hyperbolic functions 

We see from a graph that the function sinh x sets up a 1-1 mapping of the 
real numbers R on to R, so that its inverse function is defined throughout R 
and we can write: 

y = sinh x o x = sinh -1 y. 
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h(e x - O 

0 

y + \/(y 2 + 1 ) (since e x > 0 ) 

In [y + J{y 2 + 1 )]. 

in [y + 7 (y 2 + !)]; 

and just as sinh is related to the exponential function, so sinh -1 is related 
to the logarithmic function. 

The function cosh x is slightly more awkward, for it maps all x e R on 
to the range y > 1 , and each of the numbers of the range (except y = 1 ) 
arises from two equal and opposite members of the domain. 


Now y = sinh x = 
=> e 2x - 2ye x - \ = 
=> e x = 

So sinh -1 y = 



If, however, we restrict the domain to x ^ 0, we can obtain an inverse 
function, and write 

y = coshx (x ^ 0) => x = cosh -1 y (y ^ 1 ). 

Now y = cosh x = j(e x + e~ x ) 

=> e 2x - 2ye x + \ = 0 
=> (e x ) 2 - 2ye x + 1 = 0 

=* e = y ± s/(y 2 - i) 

=> x = In \_y ± J(y 2 - 1 )]. 
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Bo ' 1 ~'t 1 -* 2 ~ l)= „ + J,’- f) 

= -In [y + V(y 2 - 1)]. 

Hence x = + ln \_y 4- >/(z/ 2 — !)]• 

But x ^ 0, so x = +ln [y + ^(z / 2 — 1)] 

=> cosh " 1 y = ln[y + y/(y 2 - 1 )]. 

We can now write these inverse hyperbolic functions as 

sinh " 1 x = In [x + \/{x 2 + 1 )] 
cosh " 1 x = In [x + yj(x 2 — 1 )] 


Their main importance is in the calculation of two standard integrals 
dx 


:,) l 


(0 I- 


V {x 2 + 1) 

dx 




/• 

* 


y/(X 2 + 1 ) 

put x = sinh u => dx — cosh u d u. 


Then 


f dx f cosh u d u 

J V(x 2 + 1) J V^inh 2 « + !) 


cosh u d u 
cosh u 

= I d u 


= u + c 
= sinh " 1 x + c. 


(ii) In 


I 


dx 


V (* 2 - 1 ) 


put x = cosh u 
Then 


f d * f 

J V (* 2 -T J 

■J 


dx = sinh u d u. 
sinh u d u 


y/( cosh 2 u — 1 ) 
sinh u du 


sinh u 
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= J d “ 

= u ~h c 
= cosh - 1 x +. c. 


Hence 


dx 

^/(x 2 4 - 1 ) 
dx 

s/(x 2 - 1 ) 


sinh 1 x + c 
cosh - 1 x + c 


Exercise 9.9b 

1 Express as logarithms: 

(i) sinh -1 f; (ii) cosh -1 2 ; (Hi) tanh -1 x. 

2 Find the following integrals: 


(0 
{iv) 
( 1 vii 
{ix) 


dx 


yj ( X 2 + 4) ’ 

dx 


m h ""’Itis 

”> I 


V(X 2 + 2x) : 


f dx 

J V(9* 2 - 4) ; {V ' 

ii) J* sinh -1 xdx; (viii) j* cosh -1 xdx; 


2 + 9)’ 

dx 


V(x 2 + 2x + 2)’ 


J V(* 2 

3 Evaluate: 

» iwh 


+ 1 ) dx; (x) 


l ^ - 


4) dx. 


+ 1 )’ 


(n) 


dx 


V (x 2 - 4) : 


(Hi) J V(* 2 - !) d*; (iv) J y/( 

dx 


4 ( i ) Find 


1 + 4x 2 ) dx. 


by two methods: the use of partial fractions, and 


the substitution x = tanh u [see also no. 1 (m)]. 

(ii) If a particle falls from rest through a medium whose resistance is 
proportional to the square of the velocity, it can be shown (in the usual 
notation) that 

dv dv 9 

— = v — = g — kv . 
d t dx 

Hence find v in terms of x; v in terms of t; and the limiting value of the 
velocity. 
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9.10 Lengths of curves and areas of surfaces 

We have already used the method of integration to find both plane areas 
bounded by curves and also volumes bounded by surfaces of revolution, 
and we now complete our account by a short investigation into the lengths 
of such curves and the areas of such surfaces. A refined treatment of these 
topics is difficult as it is necessary to seek protection against the curve 
being too crinkly or the surface being too corrugated, so we shall restrict 
ourselves to an introductory look at smooth curves and at the surfaces 
formed by their revolution. 


The length of a curve 



Suppose that a fixed point A is taken on the given curve and that the arc- 
length AP from A to a point P (x, y) of the curve is j. 

Let us further suppose that P' (x + 8x,y + Sy) is a neighbouring point 
of the curve and that PP' = Ss. 

Then (<5j) 2 « (8x) 2 + ( by) 2 (1) 



So it appears (and, provided the curve is smooth, this can be fully justified) 
that: 



Alternatively, if x and y are known in terms of a parameter t , 
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Example i 

Find an expression in terms of x for the length s of the curve y = c cosh 
measured from its lowest point. (It can be shown that this curve, known 
as a catenary , is the curve in which a uniform flexible chain hangs when 
suspended between two points. The constant c = T 0 /w , where T 0 is the 
tension at its lowest point and w is its weight per unit length.) 



y = c cosh - 
c 
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(Hence, further, we see that 

y 2 — s 2 = c 2 cosh 2 - c 2 sinh 2 - = c 2 

c c 

=> y 2 = + c 2 .) 

Example 2 

A wheel of radius a is rolling in a straight line along a horizontal road. Find 
the total distance travelled by a point on its circumference between succes¬ 
sive contacts with the road. 



Suppose that the point P was initially in contact with the road at O, but 
that the wheel has since rolled through an angle 9. If M is its new point of 
contact, since the wheel has been rolling, 

OM = PM = a9. 

Hence x = a9 — a sin 6 
and y = a — a cos 9. 

So the curve traced by P (known as a cycloid) can be expressed parametric¬ 
ally as 

x = a(9 — sin 9), y = a( 1 — cos 9 ), 

and we need to find the length of this curve between the points 9 = 0 and 
9 = 2n. 
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r 2n 

= a I 7(2 — 2 cos 6) d 6 
7(4 sin 2 d6 
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= a 


= 2 a 


sin 40 d0 


= 2a[-2cos|0]f 
= 2a[ —2( — 1) + 2(1)] 


s — 8 a. 


Exercise 9.10a 

1 Sketch the following curves and find their lengths between the given 
points: 

(i) y = x 312 , from (0, 0) to (4, 8); 

(if) y = In sec x, from x = 0 to 

(in) x = 2 at 3 , y = 3at 2 , from t — 0 to t = 1; 

(iv) x = at 2 , y = 2at , from t = 0 to £ = 1. 

2 Sketch the astroid 

x = a cos 3 y = a sin 3 t 
and find its total length. 


The area of a surface of revolution 


We begin by investigating the simplest case, of a conical surface being 
formed by the revolution of a straight line. Now if a complete cone of 
base-radius r and slant-length l is cut along a generator, it can be un¬ 
wrapped to form a circular sector of radius l and arc 2nr. 
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We now proceed to consider the section of a conical surface whose end- 
faces have radii R , r, and whose slant-length is /: 



If the semi-vertical angle of the cone is a, then the area of its curved surface 


= nR(R cosec a) — nr(r cosec a) 
= n(R + r)(R — r ) cosec a 

= 7r(/? t) l 

= 2»^/. 


Hence 


Area of curved surface of section 
= average circumference x slant length 


We can now move to the case of a surface formed by revolving a curve 
about 0*: 




If S denotes the area of surface generated, we see from consideration of a 
small element that 


SS « 2n{y + \ by) bs « 2ny bs 
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Example 3 

Find the area of the cap cut off by the plane x — a when the parabola 
y 2 = 4<2x is rotated about Ox. 





Investigate the surface area of the portion of a sphere between two parallel 
nlanes. 
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Suppose that we need to find the area between two circles whose declina¬ 
tions from Ox are a, /? respectively. 

Required area = 2ny d^ 

CP 

= 2na sin 6 x a d6 

Jot 

— 2na 2 j* sin 9 d 6 
= 2na 2 \_ — cos 9~\ P 

CL 

= 2na 2 (cos a — cos /?). 

Now if t is the thickness of this portion, t = a (cos a — cos /?) 

=> Required area = 2nat, 

and so is dependent only upon the distance between the parallel planes and 
not upon their position. 



We can also consider the circumscribing cylinder and immediately see 
that the corresponding area on the cylinder is also 27 tat, so we obtain 
Archimedes’ theorem']* that the area of a ‘zone’ on a sphere is equal to the 
corresponding zone on the circumscribing cylinder (or, in other words, 
that if a spherical hard-boiled egg were put through an egg-slicer whose 
wires were all equidistant, each portion would have the same area of 
curved surface!). 


Exercise 9.10b 

1 Find the areas of the surfaces of revolution formed by rotating 

(i) the catenary y = c cosh (x/c) about the x-axis, from x = 0 to x = c\ 

(it) the parabola y 2 = 4ax about the x-axis, from x = 0 to x — a\ 

(Hi) the astroid x = a cos 3 0, y = a sin 3 6 about Ox; 

f Archimedes of Syracuse (287-212 bc), who requested that this diagram be inscribed on 
the splendid tomb which was erected for him by the Romans. 
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(iv) the cycloid x = a(9 — sin 9), y = a( 1 — cos 9) about Ox, from 
9 = 0 to 2n. 

9.11 Areas and lengths in polar coordinates 

If we are given the equation of a curve in Cartesian coordinates, we have 
seen how the inclination 1 j/ of its tangent to Ox, the area A beneath the 
curve and its length s can all be derived from an examination of the dia¬ 
grams 



from which we immediately see that 

5A « y <3*; tan * « (Ss) 2 * (<5*) 2 + (Sy) 2 



Now in polar coordinates, instead of the inclination ij/ which the tangent 
makes with Ox, it is usual to consider its inclination <fi to the radius (or, if 
preferred, its angle of trail ): 
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So, just as in Cartesian coordinates, we can find formulae for A , </>, and j 
by considering the figures formed as P(r, 9) moves along the curve to its 
neighbouring position P'(r + <3r, 9 + 59 ): 



From the second figure, it is clear that 

5A « AOPP' = ^r(r + Sr) sin 59 
=> 5A & \r 2 59. 

Furthermore, since PN has been drawn perpendicular to OP' it follows 
that 

ON = r cos 59 ~ r 
and PN = r sin 59 « r 59. 

So in the elemental AP'PN, 

PN = r 59 , P'N « Sr, L NPT « <£. 

r 59 

Hence tan (p « —— 
dr 

and (&) 2 « (Sr) 2 + (r S0) 2 . 

Summarising these three, we see that 
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Example i 

Ifr = e 6cota (where a is a constant), find A, </), and s in terms of r (measuring 
A and j from the point where 0 = 0). 

(i) A = j* ^r 2 d0 

= ±e 29coi «d e 

Jo 

“^20cota “ 6 

4 cot a 0 

= \ tan a (^ 20cota — 1 ) 

= i(r 2 — 1 ) tan a. 

/ \ , d e 

(ii ) tan <p = r — 

dr 

r _ e 9cot « 

~ dr/d9 = cot a / cota 
=> tan </> = tan a 
=> cf) — a. 


Hence (f ), the angle of trail, is constant, so that the curve is known as an 
equiangular spiral: 
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Exercise 9.11 

1 Sketch each of the following curves, finding also their total lengths, 
their total areas, and (p in terms of 0 . 

(i ) r — a cos 0 ; 

(ii) r = a( 1 + cos 0 ), a cardioid. 

2 Sketch the lemniscate r 2 = a 2 sin 20. Find its total area and also (p in 
terms of 0 . 

3 Sketch the trefoil r 2 = a 2 sin 30 and find the area of each petal. 

4 Sketch the Archimedean spiral r = ad. Find the area of the sector and the 
length of arc between the points where 0 = 0 and 0 = 1 . 


Miscellaneous problems 9 


1 (i) Calculate (1 + 1 jn) n 

for n = 1,2, 3, 5, 10, 100. 

Does it appear to tend to a limit as n —> 00 ? 


(ii) If n— 1//?, show that 


l n [ 1 + I) = ln t 1 + h ) = In (1 + h) — In 1 


Letting n —► 00 (and so h —> 0), show that 


iv 


ln ( 1 H— ) —> 1 , and hence that ( 1 H— 
n) \ n 


IV 


(Hi) Prove similarly that, as n —► 00, 

1 + 


2 By the method of two staircases (4.2), show that 

*i ° 6 d* 1q6 \ io 6 -i j 

— lies between V - and Y — 

Ji x r = 2 r r =i r 

Hence show that the sum of the reciprocals of the first million integers 
lies between 13.81 and 14.82. 


3 By considering the graph of \/x or otherwise, show that 
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1 1/11 

In n — In (n — 1 ) > - and In n — In (n — 1 ) < — I —I-- 

n 2\n n — \ 

where n is an integer greater than 1 . 

The function/ (n) is defined by 

/(«) = 1 + ^ + l+- h--lnn. 

2 3 n 

Show that f(n) decreases as n increases and that f(n) — — increases as n 

2 n 

increases. Deduce that/ (n) tends to a finite limit as n tends to infinity. 

(c.s.) 


4 Show that 

n f*n +1 

In x dx < ^ In r < 

Jl r— 1 Jl 


In x d^ 


and deduce that the geometric mean of the first n positive integers is 
approximately n/e. (c.s.) 

5 ( i) Investigate the behaviour as x —► oo of the ratios 
In x In x In x 

~ 9 7^’ T ^' 


(ii) Show that if t > 1, 

1 1 

l < ~jt 

and hence that 

In x < 2{yjx — 1 ) 
1 1 

y/x X 


In x 

and - < 2 

x 


Deduce that as x —> oo, 

!^-o. 

X 

(Hi) Show that 

In x In 1 00 ^x 

-= 1 000 _ 

i 00 ^J X i °°o/x 

and use (ii) to deduce its limit as x —► oo. 

(iv) More generally, show that as x —> oo, In x —► oo more slowly than 
any positive power (however small) of x. 
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6 (i) Investigate the behaviour as x —► oo of 


e 


X 


X 


1ooo* 

X 


(ii ) Show that 



and use the result of no. 5 (ii) to show that as x —> 00, 

1 ** 

In-► 00 

x 

e x 

and hence that- > 00 . 

x 

(iii) Deduce that as x —> 00, 

** 

7000- 00 • 

(it;) More generally, show that as x —> + 00 , —> +00 faster than any 
positive power of x; and that e~ x —> 0 faster than any negative power of x. 


7 ^ 


AW 


[ X fe~'At 


show that I n (x) = —x n e x + nl n _ 1 (x). 

if/.( *)—►/„ as x —> +00, use the result of no. 6 to show that 

4 = nI n- 1 

and hence that, if w e Z + , 

/„ = «! 

8 (i) Sketch the graph of the function and illustrate the integral 



Can you calculate this integral ? 

(ii) Illustrate by a surface the function of two variables f (x, y), where 
f(x,y) = e~ x2 ~y 2 . 


(iii) Show that the volume under this surface can be represented in Car¬ 
tesian coordinates as 


limEE *-* 2 -' 2 Sx by — 



e x2 yl dx d y 





( 2 ) 
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then a n = - 
71 


f (*) sin nx dx. 


Assuming that the function 

f(x) = x(n — x), (0 ^ x ^ n) 

can be expressed as an infinite series^ 

00 

£ a n sin nx , 

1 

and that the coefficients are still given by the formula (2), show that in 
this case 

8 

a 2m -0, a 2m+1 - b(2jb + 1)3 

and hence sum the series 


1 1 

F + 5" 3 7 3 


(c.s.) 


10 In a population of N rats, n initially have a disease, the rest have not 
yet caught it. If a rat recovers from the disease it becomes immune. At a 
later time, t , the population is therefore composed of 1(f) rats ill with the 
disease, D ( t ) dead rats, R ( t) rats who have already recovered and S ( t) who 
are still susceptible. 

It is observed that at any stage 




dS 

d£ 


= -cSI 


where <2, b , and c are constants, and N — n > (a + b)/c. 

Prove that / increases until S falls to (a + b)/c, and then decreases. Prove 
also that the maximum value of / is 


a + b ec(N — n ) 

N -In- ; — 

c a + b 


(o.s.) 


11 Two tribes live on a desert island. Each tribe breeds only among itself 

and, in isolation, the population of each would increase at the same constant 
rate. But each tribe is eaten (by the other) at a rate equal to a constant, b, 
times the population of the other tribe. At a certain time the total popula¬ 
tion of both tribes together is N 0 . Investigate how the total population 
varies with time subsequently. (o.s.) 

12 James: ‘n is the most important constant in mathematics.’ 

John: c No, e is.’ 

Continue the discussion. (c.s.) 


*|* A typical example of a Fourier series , so called after their discoverer, J. B. J. Fourier (1768 
1830). 
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Complex numbers 


‘I met a man recently who told me that, so far from believing in the 
square root of minus one, he did not even believe in minus one. This is at 
any rate a consistent attitude. There are certainly many people who 
regard yj 2 as something perfectly obvious, but jib at yj — 1. This is because 
they think that they can visualise the former as something in physical 
space, but not the latter. Actually, yj — 1 is a much simpler concept.’ 

E. C. Titchmarsh, Mathematics for the General Reader. 


io.i Introduction 

The problem of complex numbers arises immediately in attempts to solve 
such quadratic equations as x 2 + 6x +13 = 0 => * = —3 + ^/(9 — 13), 
which is seemingly impossible since there does not apparently exist a 
number whose square is —4. Understandably, many people give up at this 
point and say in desperation that this quadratic has no solution, meaning 
‘has no solution among the reals'. But a pupil who had learned to solve 
quadratic equations by factorisation might equally well say that the 
equation x 2 — 2x — 4 = 0 cannot be solved, on the grounds that he 
cannot factorise the left-hand side. In fact, he would find, a little later in his 
mathematical education, that x = 1 + yj5 and 1 — yj 5 are solutions of 
the latter equation, and probably feel quite happy about the existence of 
the numbers 1+^5 and 1 — yj5. When at the earlier stage he had given 
up in his attempt to solve x 2 — 2x — 4 = 0 because it would not factorise, 
what he really meant was that it had no solutions among the rationals , because 
he was attempting to find rational factors of x 2 — 2x — 4. In fact, of course, 
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it does ‘factorise 5 , as (x — 1 + y/$)(x — 1 ~ -y/5), but only with the use 
of irrationals. 



2 10 12 3 


Now it is true, as Professor Titchmarsh implied in the quotation above, 
that yj 5 is a far more complicated concept than yj — 1, and we have already 
shown, in 1.5, that such numbers as y/5 are irrational in the sense that they 
cannot be expressed as the ratio of integers. Yet most people are happier 
with yj5 than with yj — 1. Why? Perhaps simply because yjb can be seen , 
with a place in the real number line ? By contrast, y' — 1 and the roots 
of x 2 4 6x 4 13 = 0 do not appear on the number line, and so it is 
tempting to suppose they do not exist. 

Now the mathematician’s attitude at this stage is to say ‘ It would be a 
great pity if we had for ever to suffer the irregularity of having some 
quadratic equations with two roots and others with none. How much better 
if all quadratics were to have two roots! We shall not let an equation like 
jv 2 + 6 jv +13 = 0 beat us! Let us invent a “number” yj — 1, which we 
shall call z, and suppose that it obeys all the usual laws of algebra (which 
will be stated in full in 14.1)’. Upon this bold hypothesis we should then 
say that the roots of* 2 4- 6* 4 13 = Owere — 3 4 yj — 4 = —3 4- 2^/— 1 
= —3 4 2 i. Notice that the word 4 number 5 applied to yj — 1 is in inverted 
commas, because we must not think of it in the same terms as we have 
thought of numbers hitherto, as a length or a measure. Rather is it a number 
in the sense that it behaves like ordinary numbers, and for some time we 
may use i without being too scrupulous about its meaning. This is what 
happened historically: for centuries mathematicians ‘played with i\ 
treating it as an ‘ordinary 5 number and using it to obtain extremely 
valuable results. For example, 

(a 2 4- b 2 )(c 2 4- d 2 ) = (a 2 — i 2 b 2 )(c 2 — i 2 d 2 ) 

= (a 4- ib)(a — ib) (c 4- id)(c — id) 

= (a 4- ib)(c 4- id) x (a — ib)(c — id) 

= (ac 4- i 2 bd 4- ibc 4 iad) (ac 4- i 2 bd — ibc — iad) 

= [ac — bd + i[bc 4 ad)~\[ac — bd — i(bc + ad)~\ 

— (ac — bd) 2 + (be 4 ad) 2 . 

The final result is quite valid, even though the working involved the use 
of the ‘invented 5 number i (the word ‘imaginary 5 is a somewhat unfor¬ 
tunate choice). And in many other situations, mathematicians have found 
that perfectly respectable results can be derived — often more shortly — 
by the use of z, and this provides strong encouragement for belief in this 
extension of the number system resulting from such a bold hypothesis. 
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A word of warning is necessary about bold hypotheses. Sometimes 
mathematicians have to accept defeat. For example, the equation ax = b 
always has a solution, provided a ^ 0. What a pity, exclaims the true 
mathematician, that this exception should have to be tolerated; that there 
should be just one number 0 which does not possess a reciprocal. Let us 
therefore invent a new number, oo, such that oox0=l=0xoo. 

This time, unfortunately, the bold hypothesis must be rejected. For, 
starting with 0 x 1 = 0 x 2 and multiplying both sides on the left 

by oo, we have oo x (0 x 1) = oo x (0 x 2) 

=> (oo x 0) x 1 = (oo x 0) x 2 (associativity of 

multiplication) 

=> 1 x 1 = 1 x 2 (since oo x 0 = 1, by 

definition) 

=> 1 = 2, a contradiction. 

So we have run into trouble, and the assumption, that a reciprocal for 0 
can be dreamed up, is untenable. 

However, it can be shown that no such inconsistency arises in the case 
of the invention of i, and that everything proceeds happily. Nevertheless, a 
rigorous justification of the invention must be provided at some stage. 
Historically, this came early in the nineteenth century after i had already 
had a long and useful innings. For as early as the sixteenth century mathe¬ 
maticians were experimenting with the square root of minus one, and had 
observed that consistent and acceptable results could be obtained, though 
they were unable to justify these procedures. Later mathematicians such 
as Euler (1707-83), continuing the work of John Bernoulli (1667-1748) 
and de Moivre (1667-1754) developed complex numbers as powerful 
mathematical tools; whilst, at the end of the eighteenth century, Wessel 
and Argand gave their celebrated geometrical interpretation. But it was 
not till the beginning of the nineteenth century that Gauss (1777-1855) 
laid a sound logical foundation for complex numbers, and then proceeded 
to develop the theory of functions of a complex variable. 

Now the invention of i to provide solutions for all quadratic equations 
may possibly give the impression that further inventions may be necessary 
to solve cubic equations, and possibly that even more weird and wonderful 
numbers might have to be dreamed up to deal with equations of higher 
degree; and what further miracles of man’s genius would be needed to 
assign meanings to such expressions as (2 — i) 1 and >f[sin (5 — 2i) 2/3 ]? 
Happily, such further inventions are not necessary; the extension of the 
number system by the single step of adjoining the square root of — 1 
enables all these situations to be covered! The roots of equations of any 
degree of complication, and numbers of any degree of "complexity 5 , can 
be expressed in the same simple form, x + iy , where x and y are real, just 
like the roots of the quadratic equation x 2 + 6* + 13 = 0 that we 
examined at the outset. It is this remarkable property of playing a unifying 
role in mathematics that endows i with such interest and importance. 

Our work in the present chapter will therefore rest on the assumption 
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of a number i satisfying i 2 = — 1, and we define a ‘complex number’ to 
be the sum of two parts: 

z = x + iy , 

where x and y are real, x is called the ‘real part’ of the complex number, 
and the second term, iy , is the so-called ‘imaginary part’. We also write: 


* = R ( z )> iy = /(*)• 

In the manipulation of complex numbers, we simply assume that the 
symbol i will behave exactly as an ordinary number, and that in any 
manipulations i 2 can be replaced by — 1. 

We observe immediately that the sum and difference of two complex 
numbers are also complex numbers; or, as we shall say in 14.2, the set of 
complex numbers C = {x + iy:x,yeR} is closed under addition and sub¬ 
traction. For ifzj = + iy x and z 2 = x 2 4- iy 2 , 


then z l 4 z 2 = x x + iy 1 4- x 2 4- iy 2 

= Oj + x 2 ) 4- i(y i + y 2 ), 

which is also a complex number; and similarly for subtraction. It is less 
obvious that the set is closed under multiplication and division. 


However z x z 2 — (x t 4 - iyi)(x 2 + fi/2) = {x x x 2 — y\y%) + *(^1^2 + * 2 #i)> 


while 


£4 = *1 + 1 = (*i + iy 1 ) (‘^2 - ^2) 
z 2 * 2 4 iy 2 (x 2 4- iy 2 )(* 2 - iy 2 ) 

_ *ix 2 4- y x y 2 .x 2 y x - *jy 2 

2 2 1 ^2 2 9 

x 2 4- y 2 x 2 4- y 2 


both of which are in the form x 4- iy. Thus the repeated processes of 
addition, subtraction, multiplication and division by numbers other than 
zero will lead every time to another complex number, and the system is 
self-contained. 

Moreover, we note that two complex numbers are equal if and only if 
their real and imaginary parts are separately equal: 

For z 1 = z 2 => x t - x 2 = i{y 2 - y,) 

=> (xj - x 2 ) 2 = ~{y 2 - ytf 
=> (x, - x 2 ) 2 + (y t - y 2 ) 2 = 0 
=> x x = x 2 and y x = y 2 


(since each of the perfect squares of real numbers must be positive or zero). 
The converse is trivial; and we have also the corollary that 

z = 0 x = 0 and y = 0. 
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Example 

In each of the following cases express the given number in the form x + iy : 
7 — 3i 1 — 3i 1 - 3 i 1 - 3 i 


(0 


(2 - 4 ?) 2 “ 4 - 16* + 16? 2 _ —16* - 12 _ -4(3 + 4?) 

(7 - 3*)(3 - 4 i) 

~ -4(3 + 4?) (3 - 4?) 

(multiplying top and bottom by 3 — 4? in order to obtain 
a real denominator) 

21 - 37* + 12 ? 2 9 - 37? 


-4(9 - 16? 2 
(?•?') (1 + i)~ 3 - (1 - ?')- 3 = - 


-100 

1 


= -0.09 + 0.37?'. 

1 


1 + 3? 4 - 3 ? 2 + ? 3 1 — 3?' 4” 3 ? 2 — ? 3 

1 1 


-2 + 2 ? —2 — 2 * 

1/1 1 


2 \1 + *' 1 - i / 

11 -?-!-? - 2 ? 


— i. 


(???) 


x + iy 


2(1 +i)(l-i) 4 

(x + iy) {x - 1 - ty) 


z — 1 x + iy — 1 (x — 1 + iy){x — 1 — iy) 
x 2 + y 2 — x — iy x 2 + y 2 — x 


(x - l ) 2 +y 2 
or we might proceed: 


+ 


z 1 

= 1 + - 7 = 1 + 


(x - l ) 2 + y 2 (x - l ) 2 + y 2 ’ 

x - l - iy 


Z — 1 


z — 1 


(x — 1 4- iy) (jc — 1 — iy) 


etc. 


Exercise io.i 

i Express the following in the form x + iy : 
(?') (-1 - ?') + (-2 - 5?'); 

(?'?') (4 - 3?') - (6 + 5?'); 

(Hi) 5?(1 — 2?); 

(iv) (4 + 30(2-i); 

(0 (7 — 20(—1 - 30; 

(«) (1 - 2 ?) 2 , (1 - 2 ?) 4 , (1 - 2 ?) 8 ; 
(vii) (2 — 3?') 2 — (4 + ? ) 2 ; 

(?????) (1 + 2 ?') 3 ; 

(?x) (5 - 6?')(5 + 6?'); 

(x) (7-0(7 + 0; 
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(xi) (x — i) 5 (x e R ); 

(xii) (x + iy) 6 (, x, y € R ); 

(xiii ) Use the result of (*) to simplify-and -- 

7 — i 7 + ? 


2 Express in the form x + iy : 


(0 


1 

2 + V 



1 + 

3 - 2 V 


M 


i + 1 
i - r 


{iii) (i-2) (1 — 3i) ’ 
(vi) (1 + 2i)~ 3 


(vii) 


(2 - Q 2 (1 +Q , 
1 - 3 i “ ’ 


(urn) 


1 

cos 6 — i sin 9 


3 Find integers p , q such that (3 + 7/) (/> + is purely imaginary. 

4 If z = x + iy (x, y e R), express in the form X + iY: 

(i) z 3 ; (ii) z~ 2 \ (Hi) z H- 

z 


5 Show that (4 — 5i ) (2 + 3 i) = 23 + 27, and deduce factors of 23 — 2 i. 
Hence show that (4 2 + 5 2 )(2 2 + 3 2 ) = 23 2 4 - 2 2 . 

Use a similar method to express (4 2 + 7 2 )(9 2 + 5 2 ) as the sum of two 
squares. 

z z — 1 5 

6 Solve the equation for z: -: H-:— = -:* 

3 4 4 1 5 1 3 — 4 x 

7 Find real values of x and y such that (1 4-3 i)x + (2 — 5i)y 4 - 2 i = 0. 

8 Evaluate yj( 1 4 - i) and ^J (5 — 127) by letting the result be x + iy in 
each case. 

9 Verify that x = 2, y = — 1 satisfy the simultaneous equations 

* 3 — 3 xy 2 = 2, 3 x 2 y — y 3 = —11. 

Hence find the cube root of 2 — 11 1 . 

10 Find real values of x and y to satisfy 

x y —5 + 6 / 

1 - 2 i + 3 - 2 i = 1 + 8 ; ’ 

11 Solve the equations: 

(/) z 2 + 4z + 29 = 0; (ii) 4z 2 - 12z + 25 = 0; 

(777) z 2 + 2/z + 1 =0; (iv) z 2 + iz = 2; 

0) z 3 + 10z 2 + 37z + 42 = 0; (vi) (2 + i)z 2 - z + (2 - i) = 0. 

12 Find the quadratic equations whose roots are: 

(i) 3 + 75, 3 - 75; (ii) -3 + i, -3 - i; 

(iii) 2 + 7 3i, 2 — 7 ^*! (* y ) — 1 + 2 i. ( 
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13 Find the sum and the product of the roots z x , z 2 of the equations 

(t) z 2 - 5z + 7 = 0; (u) z 2 + z + 7 = 0. 

Obtain in each case the value of z\ + z 2 . What do you notice, and how 
do you account for it ? 

14 Sum the series: 

(0 1 + i + i 2 + i 3 + • • • + i"; 

(ii) 1 - 2 i + 4i 2 - 8 i 3 + ••• + (- 20 " -1 ; 

(iii) 1 + 2i + 3i 2 + 4i 3 + • • • + n?~\ 

x ^ 

15 Express -7, - in partial fractions in the form- 1 - 

x 2 + 1 r x + { x - i 

Repeat for the expression —--* 

x + 1 


10.2 The Argand diagram 

We have seen that any complex number consists essentially of a pair of real 
numbers: z = x + iy, where x and y are real. In view of this aspect of a 
complex number as an ordered pair of real numbers, it is natural that we 
should choose to represent them by the points in a plane using a pair of 
coordinate axes Ox, O y, and by assigning the complex number x •+- iy to 
the point whose coordinates are (x, y). It is clear that this sets up a 1, 1 
correspondence between the complex numbers and the points of the 
Cartesian plane. For example, 

i is represented by the point ( 0 , 1 ), 

-2 + 3i (-2, 3 ), 

-5i (0, -5), 

-4 (-4,0); 

and conversely, the point ( 1 , —\) represents the complex number 1 — \i, 
and so on. This method of representing the complex numbers is referred 
to as the Argand diagram after the French mathematician J. R. Argand 
( 1768 - 1822 ). Ox and O y are called the ‘real’ and ‘imaginary’ axes, for 
those complex numbers of the form x + 0 i, i.e., the real numbers, are 
represented by points of the x-axis, whose role is thus seen to be simply 
that of the real number line; while those complex numbers whose ‘real 
parts ’ are zero appear in the Argand diagram on the ^-axis, and are some¬ 
times loosely referred to as ‘purely imaginary’, among them being the 
number i itself. 

We have already emphasised that it is a mistake to try and think of i as 
a number in the sense of a ‘magnitude’, and it is only after the Argand 
diagram has been established that it becomes possible to acquire any sort of 
visual insight into the nature of i. 
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If i is regarded as a geometric operator, or an instruction to perform a 
certain geometrical transformation, then the equation i 2 = — 1 may be 
translated to mean that the operation i 9 when performed twice in succes¬ 
sion, is equivalent to the operation represented by multiplication by the 
number —1, which causes a reversal of direction. For if P(*, y) represents 
x -h iy> then the effect of multiplication by — 1 is to replace x + iy by 
— x — iy (see p. 106 ‘negative’), and this reverses the vector OP, rotating 
it through 180°. Since, therefore, i 2 causes a 180° rotation in the plane, it 
is a short step to interpret i as a 90° rotation, for then the equation i 2 = — 1 
states that two successive quarter-turns are equivalent to a half-turn. 
Whether the quarter-turn is clockwise or anti-clockwise is arbitrary; with 
the usual choice of Cartesian axes, it will be observed that, if U represents 
the ‘unit’ point, 1 + Oi (or the real number 1), then OU must be given 
an anti-clockwise quarter-turn to take up the position OI, where I repre¬ 
sents the number i, so that with the usual system of axes the sense of the 
90° rotation representing i is anti-clockwise. 

i y 



so again we see that multiplication of any complex number by i leads to a 
rotation of the vector through 90°. What is the effect of multiplication 

by — i ? 


Addition and subtraction in the Argand diagram 
If Zj = x 1 + iy x and z 2 = x 2 + iy 2 • 

Then z x + z 2 = {x 1 + x 2 ) + i(y i + yfj- 
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So complex numbers are added like displacements, or like position vectors, 
according to a parallelogram law. Indeed, the above equation might have 
been written as 

/*i\ / * 2 \ 1 + * 2 V 

\yi/ \y 2) \y 1 + y 2 / 

this means we can regard complex numbers in the Argand diagram to be 
represented as well by position vectors as by points. But it must be empha¬ 
sised that a complex number z corresponds not only to OZ but to the whole 
class of equivalent vectors which are equal and parallel to OZ (or, as we 
shall say in Chapter 14 , the additive group of complex numbers is iso¬ 
morphic with that of the two-dimensional vectors). 


Continued addition 



When several complex numbers are being added, there is no need for 
them to be combined laboriously in pairs using the parallelogram law 
repeatedly. Any number of vectors may be added by 4 nose-to-tail ’ addition, 
the order in which they are added being immaterial in view of the associa¬ 
tivity and commutativity of vector addition. Such an addition of complex 
numbers z x , z 2 , z 3 , z 4 is illustrated by the vector polygons overleaf. 
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The negative of a complex number 

If z = x + iy , then — z = — x — iy, so that — z is obtained geometrically 
from z by means of a half-turn about the origin. 



The subtraction of complex numbers 

When we say, in the field of real numbers, that 7 — 4 = 3, this is open to 
two interpretations: 

(; i ) 3 is the number which, when added to 4, makes 7: 3 4-4 = 7. 

More generally, a — b = c <=> b + c = a; 

(ii) 3 is the sum of 7 and —4: 7 + (—4) = 3. 

More generally, a — b = a + ( — b), or subtraction is equivalent to 
the addition of the negative , or additive inverse (see 14.1 ) . 

Similarly in the field of complex numbers, z x — z 2 has two interpreta¬ 
tions, each of which can be seen in the Argand diagram: 


(i) z x — z 2 is represented by that vector which, when added to z 2 , gives 
Zj, and (it) z x — z 2 is the sum of z x and ( — z 2 ), and when z x and ( — z 2 ) 
are added by the parallelogram law to give z x — z 2 it is immediately clear 
that the vector joining the origin to this point is equivalent to that joining 
z 2 to z x . 



10.2 THE ARGAND DIAGRAM 107 




The first of the two geometrical interpretations is more useful in practice, 
and we may say generally that, if complex numbers a, b, c,. . . are repre¬ 
sented by points A, B, C,. . ., then 

a — b is represented by BA 

b - a AB (note the ‘ reversal ’) 

q - P PQ. 

and so on. 



Multiplication by a real number 

It is evident in general that if two complex numbers z x , z 2 are so situated 
that the vectors OZ x and OZ 2 have the same direction , then one is a real 
multiple of the other. For example, if z t = 3 + 5i, and z 2 = 9 + 152, 
then OZ 2 — 3 OZ x , these vectors having the same direction. Similarly, if 
z 3 = -1^ - 2 ji then we have OZ 3 = -\OZ x : these two vectors have 
the same direction, but the minus sign informs us of their opposite senses. 
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Exercise 10.2a 

1 Plot the following points on the Argand diagram: 

(0 1 - i\ («) -2 + 3 i; (Hi) —2 - 3i; (tv) 4 - i; (v) -12 + 5i; 
(vi) cos + i sin ( vii) cos n + z sin n; {viii) —\yj3 + \i. 

2 If 2 = 1 + z, simplify z 2 , z 3 , z 4 , . . ., and mark these points on the 
Argand diagram. 

3 If Zj = 2 — z, z 2 = 1 + 3z, mark on the Argand diagram the points 

. Z 1 Z 2 2 

Z 1 > z 2 3 Z 1 "1 Z 2 3 Z 1 z 2 3 Z 2 Z 1 3 Z 1 Z 2 3 ’ » z i 3 CtC. 

Z 2 Z 1 

4 Let z = — 5 + 12z. Plot the following points on the Argand diagram: 

z, zz, z — 1, 3z, — 3z, -3 z 2 . 

z 

Mark also the points z', — z' and zz' where z' = —5 — 12z. 

5 Show that the points a , b, r, d (e C) form a parallelogram if and only 
if the sum of two of them is equal to the sum of the other two. 

6 Give a geometrical interpretation of a — b = +i(c — d) where 
a , b, c, de C . 

7 If A, B represent complex numbers a, b , obtain the points which 
represent: 

(i) |-(zz + 6); (zz) (zzz) zz + — zz); (zz;) b + £(<z — 4). 

8 ABCD is a rhombus with ^4C = 2BD. If b = 3 + z, d = 1 — 3z, find 
a and c. 

9 The centre of a square is the point — 2 + i, and one vertex is 1 + 3z. 
Find the others. 

10 ABCD is a square, with zz — 3 + z, b = 4 — 2i. Find possible posi¬ 
tions of C and D. 

11 ABCD is a square with a — — 2 4- 5z, c = 4 — 8 i. Find b and d. 

12 If a = 2 — i, b = 4 — 3z, find two values of r so that triangle ABC 
is equilateral. 


Division of a line in a given ratio 

If P is the mid-point of AB, it is also the mid-point of the other diagonal of 
the parallelogram formed b^ OA and OB as adjacent sides, so that 
p - i( a + *)• 
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a 


More generally, if P divides AB in the ratio P : a, then since AP and PB 
have the same direction, and their lengths are in the ratio /?: a, it follows that 


a AP = pPB 

so that ot(p ~ a) = p(b — p). 


Hence (a + p)p — oca + fib => 



a a + Bb 

p = T+T' 


Note that (a) when a = jS = 1, we get p = as the mid-point 

of AB; (£) when either a or jS is negative, the point P will lie outside the 
line segment AB, in the extension of either AB or BA; (c) we must exclude 
the case a = — (d) we choose to name the two parts of the ratio so that 

P and a are reckoned from A and B respectively. The reason for this is 
explained elsewhere (see ii.i). Briefly, P is the centre of mass of a at A 
and P at B respectively. 


Centroid of a triangle 

If G is the centroid of triangle ABC whose medians are AP, BQ, CR, we 
have p = ^(b + c). Now G divides each median in the ratio 1 :2 so that 
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PG : GA = 1:2 => g = 3P ~ t" = § * ~\r c) -\- — \{a + b + c). 

Equating real and imaginary parts would give us the coordinates of G as 

Ci(*i + * 2 + *s)> s(yi + y 2 + V s)] 3 

where A, B, and C are (x 1 , y x ), (x 2 , y 2 ), an< ^ (*35 # 3 )- 



Exercise 10.2b 

1 Find the centroid of the triangle ABC where a = —2 i, b — 4 + i, 
c — —10 — 5 i. 

2 ABCD is any quadrilateral. Prove by using complex numbers that the 
lines joining the mid-points of AB and CD, of AC and BD, and of AD and 
BC all bisect each other. 

3 ABCD is a parallelogram. P is the mid-point of BC, BD and AP inter¬ 
sect at F. Prove by using complex numbers that F trisects both BD and AP. 

4 Given any three complex numbers a, b , c, show that there exist real 

numbers a, /?, y such that aoc + bfl + cy = 0. Show further that A, B, 
and C are collinear when a + + y = 0. 

5 The triangle ABC is equilateral. Show that 
c = a + b) + jy/3(a — b)i 

and express c also in terms of CD where cd — cos §7T + i sin §7L Find also 
the complex number representing the centre of the equilateral triangle and 
show that 

a 2 + b 2 + c 2 — be — ca — ab = 0 

is a necessary and sufficient condition for the triangle ABC to be equi¬ 
lateral (which may be used to prove ‘Napoleon’s theorem’, see 10.2). 

6 Given points A, B, C representing complex numbers a, b, c, show how 
to construct the positions of: 

a + b, 3a + 2b, b — 2a, a — c, a + b + c, 
a + b — c, a + 2b — 3c. 
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Polar form , modulus , and argument 

We have already seen that the addition of complex numbers has an 
immediate interpretation on the Argand diagram in terms of the addition 
of vectors. When it comes to multiplication, ifz = z 2 z 2 , 

then z = z x z 2 = (x x + iy x ) (x 2 + iy 2 ) 

= (*i* 2 - y\yz) + i(xiy 2 + * 2 yi)> so that 
=> X = x t x 2 - y 1 y 2 \ 
y = x x y 2 + x 2 y x ) 

These formulae for the Cartesian coordinates of the point representing the 
product do not help us to locate it readily on the Argand diagram, and in 
order to provide a simple interpretation it is necessary to use polar co¬ 
ordinates. 



We have, for all non-zero values of z( = x + iy ), 
x = r cos 0, y — r sin 0 

where r (= the distance OZ) is called the modulus of z, written |z|, 

and 0 (— Z_XOZ) is called the argument (or amplitude in some older 

books), written arg z. 

Hence z = x + iy may now be expressed as z = r (cos 9 + i sin 9 ), which 
we shall usually abbreviate to z = r cis 0, 

where r = |z| = J(x 2 T z/ 2 ), and 0 = arg z = tan -1 -• 

x 

(It must be emphasised that, although when working with polar coordinates 
it is possible for r to be negative, here r is essentially positive.) 

So, for example, if z = 2 — 5i } 

then |z| = V(2 2 + ( —5) 2 ) = ^29 

arg z (in fourth quadrant) = tan -1 ( — 5/2) = 291° 48' 

and 2 - hi = ^29 (cos 291° 48' + i sin 291° 48') 

= 1/2 9 cis 291° 48'. 
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But there is clearly an infinity of possible values of arg z, such as 651° 48'., 
1 011° 48', —68° 12', —428° 12', etc. (all of which are equivalent modulo 
360°); so of these we usually select the principal value 0, such that 
— n < 8 ^ 7i, the principal value of the argument in the example above 
being -68° 12'. 

Similarly, with the complex number —2 i, 

\ — 2i\ = 2, and arg ( — 20 = —\n. 

So —2 i = 2(cos (— jn) + i sin (— j>n)) 

— 2 cis 

Summarising, 

z = x + iy = r (cos 0 + i sin 0) 

= r cis 8 

Rz = x = r cos 8 r = |z| = ^J(x 2 + y 2 ) (modulus) 

Iz = iy — ir sin 8 8 — arg z = tan -1 y/x (argument) 

(and for its principal value, — % < 8 ^ n) 


Example i 


Express in polar form: 

(0 — V 3 + («) 2 — 30 


(Hi) 


4 + 3 i 

12 i -~5 9 


(i - 2) 2 (3f + 1) 


In all these examples, and in the following exercises, it is a great help to 
plot the complex numbers on the Argand diagram, as this enables the 
modulus and argument readily to be ‘read off’. 

(0 “V 3 + * = 2(— i>/3 + io — 2(cos 150° + i sin 150°); 

(«) |2 - 3t| = V(2 2 + 3 2 ) = Vl3, 

arg(2 — 3 i) = tan -1 (—f) (4th quadrant) = —56° 19'. 
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So 2 - 3i = Vl3cis (-56° 19') 

4 + 3 i (4 + 3 i) (-5 - 12/) 


(Hi) 


16 - 63/ 


12/ — 5 ( — 5 + 12/) ( — 5 — 12/) 

65 /16 63 


169 \65 65 

65 . 
cis a; 


169 

(for 16 2 4- 63 2 = 65 2 ) 


169 

where a = tan -1 (—f§) in the fourth quadrant. 

(i - 2) 2 (3 i +1) (3 - 4z) (1 + 30 (15 + 50 (-3 - i) 


(iv) 


i - 3 


— 3 + z 

-40 - 30i 


10 


(-3 + 0 (-3 - 0 
= -4 - 3z = 5 cis 216° 52'. 


Example 2 

If z — cis 0 , express in polar form: 

i ("\ ^ z z+l 

(?) 1 - z; (it) - - (m) --• 

1—2 2—1 

(0 1 ~ cos 9 — i sin 0 = 2 sin 2 ^0 — 2? sin j0 cos ^0 

= 2 sin ^0(sin ^0 — i cos 
= 2 sin ^0 cis( —^71 + ^-0); 

2 cis 0 

1 — (cos 0 + i sin 0) 2 
2 cis 0 

1 — cos 2 0 — 2i sin 0 cos 0 + sin 2 0 
2 cis 0 

2 sin 0(sin 0 — i cos 0) 

(cos 0 + i sin 0) (sin 0 + i cos 0) 
sin 0(sin 0 — z cos 0) (sin 0 + e cos 0) 

z(cos 2 0 + sin 2 0) 

~— n, • ? n - b = i cosec 0; 

sin 0(sm 2 0 + cos 2 0) 
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..... z + 1 cis 0 + 1 cos 0 + i sin 0 + 1 

(in) -= - = - 

z—l cis 8 — 1 cos 9 + i sin 0—1 

_ 2 cos 2 \9 + 2 i sin \9 cos \8 

— 2 sin 2 \9 + 2z sin \9 cos \9 

cos ^0(cos j9 + i sin ^0) 

— sin ^0(sin \9 — i cos -|0) 

— cot -|0(cos ^0 + i sin ^0) (sin ^0 + i cos \8) 

sin 2 ^0 + cos 2 f0 

— — i cot |0. 



Note that the two questions above may be done more easily by using the 
device: 


cis 0 + 1 = cis |-0(cis ^9 + cis( — £0)) — cis ^0x2 cos \9 
= 2 cos \9 cis |0 

cis 0 — 1 = cis |-0(cis ^0 — cis( — j>9)) — cis f0 x 2 i sin |0 
= 2i sin ^0 cis ^0 = 2 sin ^0 cis(^0 + ^7t). 

We shall return to these again on pp. 122, 137. 


Inequalities 

Since z x + z 2 is represented by a side of the triangle, it is immediately 
clear that 


1*1 + Z z\ < \ Z l\ + i Z 2 


(and the equality would only occur if the triangle collapsed into a straight 
line). 
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More generally, we see that 
\z 1 + z 2 + • • • + z n \ < |z,| + \z 2 \ H-1- \ z „\, 



while the two further results 

\zi - z 2 1 ^ \ Zl \ - \z 2 \ and |z, - z 2 | ^ IzJ + |z 2 | 
also have simple geometrical counterparts. 

Exercise 10.2c 

1 Express the following complex numbers in the polar form 
r (cos 0 + i sin 9 ), 

giving the principal value of the argument, and illustrating on the Argand 
diagram: 

(i) —1; (ii) 1 — i\ (Hi) 1 + yj3i; ( iv ) —i; ( v ) —12 — 5i; 

(vi) —3 + 2 i; ( vii) sin a + i cos a; (viii) sin a — i cos a. 
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2 Express in the form x + iy : 


(i) 4 cis — ; (ii) 5cis7c; (iii) 10 cis J; ( iv ) 8cisl65°; 

In 

(v) 2 cis 800°; (w ) 6 cis — • 

4 

3 If a = 2 + i, b — 1 + 3z, mark in the Argand diagram the points 
representing: 

(i) ab\ (ii ) - ; (Hi) (iv) a 2 ; ( v ) b 2 ; ( vi ) a 2 + b 2 

b a 

and find the modulus and argument in each case. 

4 If a — cis|7C and b — 2 cis(—^7t), evaluate: 

(0 a 2 , a 3 ,.. (ii) b 2 , b 3 ,...-, (iii) ab, y 

Give the principal value of the arguments and mark each point in the 
Argand diagram. 

5 Simplify cis 6 x cis (j). Use the result to write down 

n 47i 

(i) cis — x cis—; (ii) (cis 80°) 3 . 

6 3 

6 Solve the equations, giving the roots in polar form: 

(i) z 2 — z -h 1=0; (ii) z 2 + z + 1 = 0; 

(iii) 9z 2 + 12z + 5 — 0; (iv) z 2 — z + 1 —7 = 0. 

7 Find quadratic equations whose roots are: 

(i) cis 120°, cis 240°; (ii) cis-, cis ——; 

5 5 

(iii) cos a + i sin a, cos a — i sin a. 

8 By using the identity (a 2 + b 2 )(c 2 4 - d 2 ) = (ac — bd) 2 + (be + ad) 2 , 
prove that \z x \ \z 2 \ — \z x z 2 1. 

9 Using only the definition \z\ 2 — x 2 + y 2 , prove that \z 1 + z 2 | ^ 

kl + hi 

10 If |Zj| = |z 2 |, prove that (z Y + z 2 )/(z 1 — z 2 ) is purely imaginary 
Interpret geometrically. 

11 Prove that |z, + z 2 | 2 + \z l - z 2 | 2 = 2|zj| 2 + 2|z 2 | 2 , and give a geo- 
metrical interpretation of this result. 

12 Show that the equation \z\ = 1 represents a circle of unit radius and 
centre O. Describe what the following equations represent: 

(i) \z - i\ = 2; (n) \z + l| = |z - 2t|; 
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(Hi) z + 3z| < 2; (iv) \z\ < 4 and \z — 2| > 1; 

(v) z - 3| + \z + 3| = 10. 

13 Illustrate the regions: 

(; i) | z — i\ > \z + i\; (ii) |z| > 3 > \z — 3|. 


14 Given that \z — 2 — i| < 5, show geometrically that 
8 < \z — 14 - 6i| < 18. 


15 


Prove that 


1 

1 

^ _ 

1 - z 

^ 1 - 1 

z| 


Multiplication of complex numbers 

We admitted earlier that the rule for multiplication, when expressed in 
Cartesian form x + iy , is cumbersome and does not lend itself to simple 
geometrical interpretation. When the polar form is used, however, a much 
simpler result emerges: 

For cis x cis 0 2 

= (cos 9 X + i sin 0 X ) (cos 0 2 + i sin 0 2 ) 

— (cos cos 0 2 — sin 6 l sin 0 2 ) 4- i (sin 9 1 cos 0 2 + cos 9 t sin 0 2 ) 
= cos (0! + 0 2 ) + i sin (0j + 0 2 ) 

— cis (0 X 4- 0 2 )> 


So 


cis 9 X cis 0 2 = cis(0 x + 0 2 ) 


Hence if Zj = r x cis 9 X and z 2 = r 2 cis 0 2 , 
it follows that 

z*z 2 — cis x r 2 cis 0 2 = r 2 r 2 cis(0! + 0 2 ). 

So, in multiplying complex numbers, we simply multiply their moduli 
and add their arguments: 

|z!Z 2 | = |zi| |z 2 | and arg(z!z 2 ) = arg z x + arg z 2 

Evidently the rule may be extended for the product of more than two 
complex numbers: 

ZjZ 2 z 3 = r x cis 9 j x r 2 cis 0 2 x r 3 cis 0 3 = r l r 2 cis(0j + 0 2 ) x r 3 cis 0 3 
= r 1 r 2 r 3 cis^j + 0 2 + 0 3 ); 

and more generally, z t z 2 . . .z„ = r l r 2 . . .r n cis(9 l + 0 2 + • • • + 0 W ). 

So kiZ 2 z 3 ...z„| = |zi||z 2 ||z 3 |...|z n | 

and arg(zjZ 2 z 3 ...z„) = arg z t + arg z 2 + arg z 3 H-b arg z„. 
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As a general rule, whenever one is concerned with the multiplication of' 
complex numbers, it is usually advisable to have them in modulus-argument 
form. If one wished to evaluate the very simple product (2 4- *)(1 + 3 i) 
(= — 1 + 7*), it would hardly be worthwhile changing into polars; but 
it is nevertheless instructive to do so as an illustration: 


Z, = 2 + ;■ = cis 26° 34'; z 2 = 1 + 3 i = JlO cis 71° 34' 
=> z,z 2 = ^50 cis(26° 34' + 71° 34') = ^50 cis 98° 8 ' 

= 5^/2 cis 98° 8 ', 

which will be found to agree with — 1 T 7 i. However, if one wished to 
evaluate (1 4 - 3i) 7 , it would be well worthwhile using the polar form to 
avoid the work with the binomial theorem. Comparing the two methods: 

(a) (1 + 3 i) 7 = 1 + 7 C 1 3i + J C 2 (3i) 2 4- 7 C 3 (3z ’) 3 + 7 C 4 (3i) 4 

+ 7 C 5 (3i) 5 + 7 C 6 (3i) 6 + (3i) 7 
= 1 4- 21* + 189z 2 4- 945 i 3 + 2 835*' 4 4- 5 103z 5 
+ 5 103z 6 4 - 2 18 7z 7 

= 1 + 21z - 189 - 945* + 2 835 4- 5 103* - 5 103 
— 2 187* (putting * 2 = — 1) 

= -2 456 4- 1 992*; 

(b) (1 + 3*) 7 = (VlO cis 71° 34 ') 7 = (VlO ) 7 cis(71° 34' 4- 71° 34' 

+ • • • 4- 71° 34') (adding the argument 7 times) 

= 1 OOOy/lO cis 500° 58' = 1 OOOj 10 cis 140° 58' 
w 3 162(-0.776 7 4 - 0.629 li) = -2 457 4- 1 991*. 

The second answer is less accurate, due to the use of 4-figure tables; but 
its advantages are plain, and would of course increase if the exponent, 7, 
had been larger. 


The unit circle 

An important special case is that of numbers of unit modulus (i.e., of the 
form z = 1 cis 6) lying on the unit circle, |z| = 1. These are multiplied 
by simply adding their arguments, i.e., by performing a rotation about 
the origin on the Argand diagram. This will be clear since 

ZjZ 2 = cis cis d 2 — cis( 0 j + d 2 ), 

so that in an Argand diagram z x z 2 may be obtained either by rotating 
about O from position z x through 0 2 ? or from z 2 through 6 t (positive anti¬ 
clockwise). In particular, multiplication of any complex number by 
*(= cis 90°) is represented by a positive rotation through a right angle. 

This leads us to consider the general interpretation of multiplication on 
the Argand diagram. Suppose we consider again the product z x z 2 where 
z x = 2 4- *, z 2 = 1 4- 3*. Then z x z 2 = ^/5 cis a x ^/lO cis /?, where 
a = 26° 34', /? = 71° 34'. If we start with z x , we may multiply by z 2 in 
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two stages: first by ^/lO, then by cis /?. Multiplication by ^10 clearly 
corresponds to an enlargement from the origin by a factor yj 10 and multi¬ 
plication by cis /3 causes the argument to be increased by ft to the value 
a + /?, i.e., corresponds to a rotation through ft, which brings us to the 
final position z ly J 10 cis/? = z x z 2 . More generally, multiplication by 
r cis 6 corresponds to enlargement from the origin by a factor r , coupled 
with a rotation, through 9 about O. Evidently the two transformations — 
enlargement and rotation, both about O — commute, and the composite 
transformation is known as a spiral similarity. 



We now see how a geometrical construction may be used to determine 
the position of the point Z representing the product z x z 2 when the points 
Zi,Z 2 representing z x , z 2 are given. Letting U be the unit point, we have 
OU = 1, OZ x = r l9 OZ 2 = r 2 , OZ = r x r 2 so that OZ/OZ 2 = OZ x /OU. 
Since each of Z_s UOZj, Z 2 OZ is equal to 6 X , it follows that As UOZ l , 
Z 2 OZ are similar, so that the position of Z may be constructed by making 
AZ 2 OZ = ZUOZj and LOZ 2 Z x = LOJJZ x . 
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Reciprocal of a complex number 

We have already seen that the reciprocal of the complex number x + iy 
is (x — iy)/{x 2 + y 2 ), the result being obtained by the use of the con¬ 
jugate x — iy . However, it is somewhat easier to consider the reciprocal 
of a complex number when it is expressed in polar form. 



Note that cis 9 and cis (- 9) are reciprocals, i.e., that 


(cos 9 + i sin 9) (cos 9 — i sin 9) = 1. 

To plot the point on the Argand diagram representing the reciprocal of a 
given complex number z, we simply make Z_UOZ ; = 0, measured clock¬ 
wise [since arg (1/z) = -0], and OZ' = 1/r, where U is the unit point. 
Since OZ/OU = OU/OZ', and AUOZ = AZOU, triangles ZUO, 
UOZ' are similar; so to perform the construction it is only necessary to 
make LUOZ' = A UOZ and AOUZ' = AOZU. Of course, Z repre¬ 
sents the reciprocal of the complex number represented by Z': reciprocals 
are interchangeable. 





Division of complex numbers 
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Zy . 1 

If we require —, this may be written z x x —, and we can now use our 
Z 2 z 2 
knowledge of reciprocals: 

— = cis Oy x — cis( — 0 2 ) = ( r i x 1 cis[ 0 x + ( — 0 2 )1 

z 2 r 2 V r 2 / 

— — cis (Oy — 0 2 ). 
r 2 

So the rule for division is to divide the moduli, and subtract the arguments — 
the reverse of the rule for multiplication. Or we may write: 

Zi |zi| Z 1 

— = -I—r; arg — = arg z, - arg z 2 . 

Z 2 \ Z 2\ Z 2 

This is what one would expect since division is the inverse process of 
multiplication. A useful exercise would be to evolve a construction for 

. . . Zi 

arriving at the point representing — starting from given positions of Zy , z 2 . 

Z 2 

This will in effect be a combination of the processes in the previous sections. 


Example 3 


Evaluate 


(*--2) 2 (3/+ 1) 
i - 3 

i — 2 = -2 + i = 


using polar form (compare p. 113) 
V5 cis 153° 26' 


=> (»' - 2 ) 2 = 5 cis 306° 52'. 

Also, 3i + 1 = 710 cis 71° 34'. 

So (i - 2) 2 (3i + 1) = 5^10 cis 378° 26'. 

Finally, i - 3 = -3 + i = ^/lO cis 161° 34' 

(* - 2) 2 (3i + 1) _ 5^/10 cis 378° 26' 
i - 3 ~ TlOcis 161° 34' 

= 5 cis 216° 52' = -4 - 3 i. 
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Example 4 

Simplify ‘ - cis ’4 an d 
1 — cis 0 

1 — cis nd 1 — cos nd — 

1 = 


1 — cis n6 


1 — cis 6 
i sin n6 2 sin 2 \nd 


2 i sin bnd cos bnd 


cis 6 1 — cos 6 — i sin 0 2 sin 2 \\6 

sin \nd (sin \nd — i cos ^nd) 


2 i sin A 0 cos bd 


sin \d (sin j0 — i cos ^0) 
sin ^nd sin ^0 x —i (cos -g -«0 + i sin ^ 0 ) 
sin ^0 sin x —i (cos ^0 H- i sin j0) 
sin iw 0 


2 

cis ^ 


sin £w 0 
cis bO sin 40 


It follows that 


CIS 


2 

nO ^ 3 


1 — cis d 


sin \r 


sin 40 


cis bin — 1 ) 0 . 


cis § (n — 1 ) 0 , 


the argument being trebled when the number is cubed. 


Alternatively, 

1 — cis nd cis \nd [cis \nd — cis(— \n0)~\ 

1 — cis 0 cis \d [cis \d — cis (— 10 )] 

cis bnd x 2 i sin bnd sin bnd 

= - —1 -rr- = - ~T7T x CIS bin — 1)0. 

cis jrd x 2 i sin \0 sin ^0 

{Writing 1 ± cis 0 in the form cis |0[cis (— £0) ± cis £0] is often a very 
useful device.} 


* Example 5 

Show that if triangle ABC is equilateral with the sense of rotation from 
A to B to C being anti-clockwise, then a + ba) + cco 2 — 0 , where a> = cis §71 
and conversely. 
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Since AABC is equilateral, CA is obtained from BC by a pure rotation 
through 120° = %n about C. 

So a — c = (c — b) cis -§71, 

=> <3 — c = co(c — b), with the notation above, 

=s> a 4- cob — (1 + c o)c = 0. 

Now co 3 = 1 => (cu - l)(co 2 + CO + 1) = 0. 

But co # 1, so co 2 + co + 1 = 0, 

(which may also be deduced from the fact that the three points 1, co, co 2 
form an equilateral triangle about the origin). Replacing (1 + co) by 
— co 2 gives immediately the required result, that a + cob + co 2 c = 0. 
Conversely, if it is known that a + cob + co 2 c = 0, then we may write 

a + cob + ( — 1 — (jl>)c = 0 => a — c = co(c — b), 

=> a — c = (c — b) cis §71. 

Hence CA is obtained from BC by a positive 120° rotation, and AABC is 
equilateral, in an anti-clockwise sense. The reader may also verify that the 
condition a + a> 2 b + cor = 0 is necessary and sufficient for the triangle 
ABC to be equilateral in the opposite or clockwise sense. 

* Example 6 

Prove Napoleon’s theorem, that if equilateral As BCP, CAQ, ABR are 
drawn on the sides of any triangle ABC, with P, Q, R outside the triangle, 
then the centres of these three triangles form another equilateral triangle. 

R fs 



p 


If APCB is equilateral with anti-clockwise sense, 
p + coc + co 2 b = 0. 
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q + coa + co 2 c = 0, 
r + ojb + co 2 a = 0. 

But if L, M, N are the centres of As PCB, QAC, RBA, respectively, then 
31 — p + c + b, 

3m = q + a + c, 

3 n = r + b + a. 

So 3(1 + com + co 2 n) 

= (p + c + b) + co(q + a -f c) + co 2 (r + b + a) 

= (p + (DC + co 2 b) + co(q + coa + co 2 c) + co 2 (r + cob + co 2 a) 

= 0. 

Hence ALMN is equilateral. 


Exercise io.2d 

i Simplify: 

(0 4 cis 150° x 5 cis 80°; 

cos 6 — i sin 0 

(») 5 — t . . t ; 

cos cp + z sin cp 

(Hi) cis 0 x cis( — 0 ); 

.. , / 3tt . . 3tt \ 2 

(m>) I cos — + z sin — I ; 

(i>) (£ — 1 ) (1 — by first expressing each in the form r cis 0 ; 

( "> (“is) 1 

(wt) (1 + cos a + 2 sin a) (1 + cos a — i sin a); 

(viii) (1 — cos ft + z sin /?) 2 ; 

(i*) cis ( — 0 ) -r cis 20 ; 

(*) cos 0 + i sin 0 4 - - 

cos 0 — i sin 0 

(arc) cis a x cis ft -r- (cis y) 2 ; 

1 — cos 0 + z sin 0 


(*m ) 


1 + cos 0 + i sin 0 


2 Prove that . .z„| = |z t | |z 2 | |z 3 |.. .|z„| 

and that arg (zjz 2 z 3 .. .z„) 

= arg z, + arg z 2 + arg z 3 H-+ arg z„. 
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3 If z = cis 0, express in polar form: 

z — 1 Z Z f I 

(0 1 + 2 ; («) -^- q — j -; ( m ) ^ 2 —(«0 y 

4 Given the position of a point z on the Argand diagram, show how to 
construct the positions of: 

3z, — 2 z, |z|, zz, — zz, — 2 iz, z 2 , zz 2 , z 2 + 2z, (z — l) 2 . 

5 If c = (1 — k)a + kb , prove that when A; is real, C lies on AB and 
divides AB in the ratio k : (1 — k); while when k is complex, the triangle 
ABC has AB: AC = 1 : |Ar|, and the angle of turn from AB to AC = arg k. 

6 Show that the triangles ABC and A'B'C' are directly similar if and 
only if 

a — c a! — c 

b - c = V - d 

and that this may be expressed as £ a ' iP ~~ c ) = 0 ; or as 
1 1 1 

a b c = 0 (see 13 . 5 ). 

a b ' c 

7 Use the result in the first part of no. 6 above to deduce the condition 

for triangle ABC to be equilateral, in the form: 

a 2 + b 2 + c 2 — be — ca — ab — 0 (compare Exercise 10 . 2 b, no. 5 ). 

Furthermore, show that this is equivalent to 

{a 4 - cob + co 2 c) (a -b co 2 b + c&c) = 0 (compare Example 5). 

8 Given a triangle ABC, points L, M, N are chosen so that A’s BCL, 
CAM, ABN are all directly similar. Prove that AABC and ALMN have 
the same centroid (use the first part of no. 6 ). 

10.3 Complex conjugates 

We have already noted that the roots of the quadratic equation 
x 2 + 6x + 13 = 0 are — 3 + 4z, and that these complex numbers appear 
on the Argand diagram as mirror images in the real axis. 

Complex numbers having this property are called conjugate . 

More generally, if 

z = x 4* iy = r cis 0, 

we define the complex conjugate of z, denoted z*, as 
2 * = x — iy = r cis ( — 0 ), 

and on the Argand diagram z and z* will be reflections in the real axis. 
We might also write (x 4 - iy )* = x — iy. 
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Note that: 


(a) When y = 0 (i.e., the complex number is real) then z is its own con¬ 
jugate : real numbers are self-conjugate, 

(b) When |z| = 1, z — cis 0, z* = cis ( — 0), and in this case conjugates 
are reciprocals. 

(c) Both the sum and the product of a pair of conjugates are real numbers: 

z + z* = (x + iy) + (* — iy) = 2x = 2 R(z) 
zz* = (x + iy)(x — iy) = x 2 + y 2 = |z| 2 ; 

however, by contrast, 


z — z* — 2 zz/ — 2 /(z). 


The reader can quickly verify that for any complex number z, 

(z*)* = z and (— 2 )* = — z*; 
also (zj + z 2 )* = zf ± zf and (ZjZ 2 )* = zfzf. 

The result zz* = |z| 2 has already proved useful when simplifying expres¬ 
sions, by applying it to make the denominator of a quotient real. For 
example, 

1 + Si _ (1 + 30(2 - i) _ 5 + 5i . 

2 + z “ (2 + 0(2 - 0 ” ^ + 


As we observed at the beginning of this section, the roots of the quad¬ 
ratic equation z 2 + 6 z + 13 = 0 are a pair of conjugate complex num¬ 
bers. Clearly this is bound to happen when solving any quadratic equation 

az 2 + bz + c = 0 , if < 2 , b, c are real and b 2 < 4 ac. 


For 


~b ± yj(b 2 - 4ac) 
2 a 


b V(4 ac - b 2 ) . 

— ± ---- l. 

2a 2 a 


which is a conjugate pair provided that the coefficients a , b, c are real. (To 
show the necessity for this last condition, we can consider the equation 


z 2 + z(-l + i) + (2 + i )=0 
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=> (z — i) (z — 1 + 2i) = 0 
=> z = i or 1 — 2 i, 

which are certainly not conjugate.) 

Again, the equation 

z 3 — z 2 — z — 15 = 0 (with real coefficients) has roots z = 3, — 1 + 2£, 
— 1 — 2 i, i.e., one real root and a pair of conjugate complex roots. 
Furthermore, the equation 

z 5 — 2 z 4 4 - 4z 3 — 2 z 2 + 8 = 0 (with real coefficients) 
may be written as 

(z + l)(z 2 + 4) (z 2 - 2z + 2) = 0 

and so has roots — 1 , + 2 £, 1 + z, and again the complex roots occur in 
conjugate pairs. 

We now prove that, for any polynomial equation with real coefficients, 
the complex roots occur in conjugate pairs. 

Let the polynomial be 

/(z) = a n z n + a n _ 1 z n ~ l + • • • + a x z + a 0 

and suppose that z is a root, so that f (z) = 0 . We therefore have to prove 
that z* is also a root. 

Now /(z*) = a n (z*) n + +,-i(z *)” -1 4 -h a x z* + a 0 

= < 2 *(z*)" + + * • • + tffz* + a% 

(since all the a n are real and therefore self conjugate). 

But fl*(z*) w = fl*(z”)* = (a n z n )*, etc. 

Hence /(z*) = {a n z n )* + (^.jz" -1 )* H-h («iz)* + a% 

= ( a n z n + a n _ x z n 1 + • • • + a x z + fl 0 )* 

= {/(*)}* 

= 0 * 

= 0 . 

So whenever z is a root, so is z*; and we see that the roots of such equa¬ 
tions with real coefficients always occur in conjugate pairs (or as real 
numbers, which are self-conjugate). 


Example 

Solve the equation 

z 3 - 3z 2 - 19z + 13 = 0, 

given that one root is 2 + 3 i. 

As the coefficients are real, another root is 2 — 3 i. So if the third root is a, 
the sum of the roots of the equation gives 
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a + (2 + 3 i) + (2 - 3z’) = -(-3) = 3 
=> a = — 1 . 

Hence the roots are — 1, 2 ± 3z, each of which may be checked in the 
original equation. 


Exercise 10.3 

1 Prove that: 


(*) i z i + z 2 )* = 4 + 4; 

(n) (Zj - z 2 )* = zf - zf; 
(m) (z,z 2 )* = zfzf; 



2 Extend the above results to: 

(0 (Z z r)* = X 4; («) (]l Z r)* = F[4 t 


3 Prove that zfz 2 = (Zjzf)*, and verify by putting: 
Zj = cis ; z 2 — r 2 cis 0 2 . 


4 If z = r cis 6, show that 
r = J (zz*), d = % (arg z - arg z*) 
and also that z 2 + (z *) 2 = 2 |z | 2 cos 26. 

Prove that arg z — 6 => z = z* cis 26. 

Is the converse true? 


5 Use conjugates instead of moduli and arguments to express: 

(0 \ z - !| = 3|z|; (ii) arg(z - 1 ) - arg(z + 1 ) = §j z. 

6 If z x and z 2 are complex numbers such that z x + z 2 and z x z 2 are both 
real, prove that z\ = z 2 . 

7 Prove that the area of the triangle OAB is — (a*b — ab*), and that 

4 1 

the area of the triangle ABC is 

1 1 1 

a b c (see also 13 . 5 ). 

a * b* c* 

8 Given that 1 — i is a root of 
2z 3 - 7z 2 + lOz - 6 = 0, 
find the other roots. 


1 

4 i 


t n denoting product , just as £ denotes sum. 
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9 If 2 + i is a root of 

2 z 3 - 9 z 2 4 - 14 z - 5 = 0, 
find the other roots. 

10 Given that z + n (n e Z + ) is a factor of z 3 4- 6z 2 -b 16 z + 16 , find 
n and then factorise completely. 

11 Solve the equation 

z 4 - 3 z 3 + 4 z 2 - 3 z + 1 = 0 , 

given that one root is -^(1 — ^/ 3 i). Solve also by making the substitution 
w = z + 1/z. 

12 Solve the equation 
z 4 - 6z 2 + 25 = 0. 

13 Solve z 4 — 6z 3 + 23 z 2 — 34 z + 26 = 0 , given that one root is 

1 + i. 

14 Show that the equation of the circle 
(* ~ a) 2 4 - (y - P ) 2 = p 2 

can be written as 

zz* — p*z — pz* + pp* — p 2 = 0, 
where/) = a + z’j8. 

15 If O is the orthocentre of the triangle formed by a , b , c, prove that: 
be * + b*c = ca* + c*a = ab* 4- a*b. 

16 Prove that the reflection of A in the line BG is the point A' given by 

, a*(b — c) 4- b*c — be * 

a =- 

_ £* 

17 In Exercise 10.2a no. 6, we obtained the condition for two triangles 
to be directly similar. What is the condition for opposite similarity? 


10.4 de Moivre’s theorem*)* 

Having established means of obtaining the sum, difference, product, and 
quotient of two complex numbers and their interpretation on the Argand 
diagram, we now investigate their squares, cubes, square roots, cube roots, 
and other powers. Initially we shall restrict ourselves to the rational powers 
of complex numbers of unit modulus, i.e., to (cis 0)”, where n€ Q. 

Case 1 ne Z + 

We know that (cis 0 ) 2 = cis 0 x cis 0 = cis (0 4 - 0) = cis 20 

t After Abraham de Moivre (1667-1754), born in France but resident in England for most 
of his life. 
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(cis 0) 3 = cis 0 x (cis 0) 2 = cis 0 x cis 20 
= cis 36, and so on. 

More generally, V ?2 € Z + , (cis 0)" = cis n6, which can be proved form¬ 
ally by induction. 


Case 2 n e Z 

Since cis 6 x cis ( — 6) = 1, 

(cis 0) _1 = cis ( — 6). 


Again, 


(cis 6) 2 = 


1 


(cis 6) 2 cis 26 


(cis 6) 3 = 


(cis 6) 3 cis 30 


= cis (— 20) 


= cis (— 30), etc. 


So, more generally, (cis 0)” = cis n6, where n — — 1, — 2, — 3,.. . (i.e., 
VneZ"). 


J 72 — 0 

(cis 0)° = 1 — cis 0. 

So again, (cis 0)" = cis nd when 72 = 0. 


Case 4 n = p/q, a rational fraction, with p, q e Z 

using the result above, 

* <7 I 

q 

= cis/70 = (cis 0) p . 


As # e Z 


cis-oV = cis (^— X 0 


p and q being integers 


So cis -0 is a ^th root of (cis 6) p , i.e., a value of (cis 6) p!q . 

q 

Summarising these results, we have de Moivre’s theorem, that 

if 72 e Z, (cis 0)" = cis 720; 

and if n e Q, cis 720 is one of the values of (cis 6) n 


It is necessary to say c one of the values of’, for we shall see later that 
(cis 6) plq has in fact q distinct values. At the moment, we note that, since 

(cis \ri) 2 — cis and (cis f n) 2 = cis fn = cis \n , 

then, (cis \n ) 1/2 = either cis or cis f7i, 

there being exactly two square roots of a complex number, which (not 
unexpectedly) are a pair of negatives. The fact that there are three cube 
roots may be more surprising. However, it is easily verified that cis 120° 
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and cis 240° are both cube roots of 1. Moreover, you may verify that 
2 — i y — 2 4 i, 1 4- 2 i, and — 1 —2 i are all fourth roots of —7 — 24 L 
De Moivre’s theorem tells us that an effect of raising a complex number 
to the power n is to multiply its argument by n. This is a result of far- 
reaching importance, leading to powerful and elegant solutions of many 
problems, and we now proceed to consider some of its applications. 


Example i 

Use de Moivre’s theorem to find formulae for cos 40 and sin 40 in terms of 
cos 0 and sin 0. 

(Here, and in subsequent examples, we shall use the abbreviation c = cos 0, 
s = sin 0). 

cos 4 6 4 i sin 40 = (c 4 is) 4 = c 4 4 4 c 3 is 4 6c 2 i 2 s 2 4 4 ci 3 s 3 4 i 4 s 4 
= ( c 4 — 6 c 2 s 2 4 s 4 ) 4 i(ic 3 s — 4 cs 3 ). 

Equating real and imaginary parts, 

cos 4 9 = c 4 — 6 c 2 s 2 4 s 4 = c 4 — 6c 2 (l — c 2 ) 4(1— c 2 ) 2 
= 8r 4 - 8r 2 4 1 

sin 4 9 = 4 c 3 s — 4 cs 3 = 4 cs(c 2 — s 2 ). 


The method may be generalised to obtain cos n9 and sin n9 in terms of 
c and starting from (c 4 is) n = cos n9 4 i sin nO. Using the binomial 
theorem and equating real and imaginary parts, we get 

cos n9 — c n — n C 2 c n ~ 2 s 2 4 n C 4 c n ~ 4 s 4 4 • • ■ 

sin n9 = nc n ~ l s - n C 3 c n ~ 3 s 3 4 n C 5 c n ~ 5 s 5 - 

It follows that 


sin n9 nc *s — C 3 c 3 5 3 4 • • • 

tan n9 = ——- =--- 9 - 

cis n9 c — C 2 c 2 s 2 4 • * • 

nt - n C 3 t 3 4 n C 5 t 5 - 

" 1 - n C 2 t 2 4 n C 4 t 4 -’ 

(dividing numerator and denominator by c n , and using t = tan 9). 


In particular, for example, 


tan 50 = 


5t - lO* 3 4 t 5 m 

1 - 10 1 2 4 5 1 4 ’ 


tan 40 = 


4 1 - 4 1 3 

1 _ 6t 2 ~Tt 4 ' 


etc. 


Example 2 

Use de Moivre’s theorem to investigate the cube roots of i. 
i = cis ; %ji = i 1J3 = (cis^7r) 1/3 , 
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so one of the values is 


cis £n = cos §71 + i sin £n = 1(^/3 + i). 

However, we may also regard i as having argument 5n/2, 9n/2,. . and 
when de Moivre’s theorem is applied we obtain i 1/3 = (cis 57 t/2) 1/3 = 
cis 5n/6, and again i 1/3 = (cis 97r/2) 1/3 = cis 3n/2. The addition of further 
revolutions to the argument of i cannot produce any different values of 
z 1/3 ; for example, (cis 137r/2) 1/3 = cis \3n/6 = cis 7 r/ 6 , which has already 
been obtained. Again, if we regard i as having argument — 3n/2, this 
gives [cis (- 3 tt/2 )] 1/3 = cis (- n/2) — cis 3n/2, also already obtained. Thus 
the three cube roots of i will be seen to have arguments separated by 
multiples of 2n/3, and so appear on the Argand diagram as the vertices of 
an equilateral triangle, at the points F 1 , P 2 , P 3 . 



Example 3 

Find the values of ( — 1 + i) 3/5 . 

Now — 1 + i = ^2 cis 135°. 

So, by de Moivre’s theorem, one of the values of (— 1 -f i) 3j5 is 
(y/2 cis 135°) s/5 = 2 0 - 3 cis 81°. 

However, any number of revolutions may be added to the argument of 
the original number, which may therefore be written yj2 cis (135° + 360° n ), 
where n is any integer. 

Thus (-1 + i) 3/5 = {y/2 cis (135° + 360° n)} 3/5 
= 2 0 ' 3 cis (81° 4 - 216° n), 

i.e. 2 0 - 3 cis 81°, 2 °‘ 3 cis 297°, 2 0 3 cis 513°, 2 0 - 3 cis 729°, 2 0 ' 3 cis 945°, etc; 
so, reducing the arguments modulo 360°, we see that 

2 0 ’ 3 cis 81°, 2 °‘ 3 cis 297°, 2 0 3 cis 153°, 2 0 ’ 3 cis 9°, 2 0 * 3 cis 225° 
are five distinct values. 
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These are shown in the figure, where the arrows show the route followed 
by adding 216° repeatedly to the argument. It is also clear that they form a 
regular pentagon, and that precisely five values exist. 



At this stage it is interesting to reflect on what has been achieved. When i 
was introduced to represent y] — 1 , it might have been expec ted that 
further inventions would be required in order to represent yj i, — 1 and 
y(l — i), and that a whole range of ‘hyper-complex’ numbers would 
have to be developed. But de Moivre’s theorem immediately shows that 
this will not be necessary, and that the simple invention of i as y/ — 1 is 
entirely sufficient for dealing with all such complications. 


Exercise 10.4a 


1 Express in the form r cis 0 where the argument has its principal value: 


(0 (cis 72°) 14 ; 
(Hi) (28 + 9 6i) 4 ; 
(v) (V3 - 0 7/4 ; 

{vii) {J3 - i ) 20 ; 

(ix) V(! ~ +3); 

(, xi) (i + tan 6 ) n ; 


(: ii) (cis 100 °) " 5 ; 

(; iv) (cos 80° — i sin 80°) 3,5 ; 

(vi) (1 + 0 10 (two methods); 

(viii) 4/(1 + »); 

(x) (sin 9 — i cos 9) 5 ; 

(cos 5a + i sin 5a) 6 (cos 3a — i sin 3a) 7 
(cos a + i sin a) 4 (cos 2a — i sin 2a)' 2 


2 Find yji: ( i) by letting its value be x + iy; (ii) by expressing in polar 
form. Find also values of ^Ji and of ^(— i). 


3 Obtain sin 30 in terms of sin 0, and cos 30 in terms of cos 0 by using 
de Moivre’s theorem. Hence show that if sin 30 + cos 30 = 0 , then either 
tan 0 = 1 or sin 20 = —j,. 


4 Find the principal value of the arguments of (cis 50°) 5 , (cis 122°) 5 , 
and [cis (— I 66 0 )] 5 . Can you find two other complex numbers which give 
the same result? 
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5 Find as many as possible values of (1 + i) 1/6 . 

6 (i) Solve z 3 — 1 = 0 , expressing the three roots in polar form. 

(ii) By writing — 1 as cis n, obtain the roots of z 3 + 1 = 0. 

(Hi) Find all the values of (cis 150°) 1/3 . 

7 Find formulae for: (i) cos 60 in terms of cos 0; (ii) sin 70/sin 0 in terms 
of cos 0; (Hi) tan 50 in terms of tan 0; (iv) sin 60/sin 0 in terms of cos 0. 

8 (i) If z + z _1 = 2 cos 0, show that z" + z~ n = 2 cos nO , and find 
z n - z“ M . 

(ii) If z + z“ 1 = — 1 , find the values of z n + z~ n for n = 2 , 3 , 4, ... . 

9 If cos a + cos P -f cos y = 0 and sin a + sin p + sin y = 0, prove 
that a, (i and'y are separated by § 7 r. Generalise in the case of the equations 

n n 

Y cos a r = 0 , Y sm a r = 0 * 

1 1 

10 Prove that (1 + cos 6 — i sin 0)" = 2 n cos n \0 (cos — i sin \n6). 
Hence write down the value of (1 4 - cos 0 + i sin 0) n and simplify the sum 
of these two expressions. 

11 Using the expansion of tan 50 from no. 7 above, show that the roots of 

t 5 + 5 / 4 — lOf 3 — 10 ^ 2 + 5£ + 1 = 0 are tan (4 n — 1) — (ne Z), 

\ y 20 y 

and obtain five distinct roots. 

Use the expansion of tan 60 to evaluate tan7i/12. 

12 Use the expansion of tan 50 in terms of t (= tan 0) obtained in no. 7 
to solve the equation 

t 5 - 10 1 4 - 10 1 3 + 20 * 2 + 5t - 2 = 0. 

13 Solve the equation sin 50 = sin 5 0. 

14 Using the expansion of cos 50, deduce that sin 18° — i(>/5 — 1). 

15 Prove that 

sin 70 = lc 6 s — 35 ^ 4 ^ 3 + 2U 2 / — s 7 where c = cos 6, s = sin 0. 

Hence solve the equation 
lx 3 - 35 * 2 + 21* - 1 = 0 
giving three distinct roots. 


The nth roots of a complex number 

The above examples illustrate how, in general, the nth roots of a complex 
number may be found. Writing the number in the form z — r cis 0, we 
have (r cis 0) 1/n = r lfn cis (9/n) as one of the roots. To obtain the other 
roots, we write z = r cis (0 + 2nk) (keZ). Then z 1/n = r l/n cis (0 + 2nk)/n. 
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It is evident that n different values are obtained by k taking the values 
0, 1,2,3 ,...,/? — 1, and that this exhausts the distinct values: when k 
takes the value n (i.e. when 2nn is added to the argument of z) the argument 
of z 1/n will have been augmented by 2n, so repeating the value corres¬ 
ponding to k = 0. So the nth roots lie in the Argand diagram on the circle 
of radius r l/n separated by a consecutive angular distance of 2n/n, and form 
a regular n-gon. 

In particular, the nth roots of unity are given by 
2jik 

(cis 2nk) 1/n = cis-•> (k = 0, 1, 2, . . ., n — 1), 

n 

. 2n 

and these form a regular rc-gon in the Argand diagram. If we let 00 = cis —? 

then the n roots may be denoted 1, co, co 2 , co 3 ,. . ., of ~*, with of = 1. It is 
clear that 1 + co + co 2 +m 3 + • —hco" -1 = 0; and furthermore, since 
1, co, co 2 , co 3 ,..., of ~ 1 are the roots of z" — 1 = 0, 

z" — 1 = (z — 1) (z — co) (z — co 2 ) (z — co 3 )... (z — 0 ) n ~ 1 ). 



The particular case n = 3 is important, and we have 1, cis f n and 
cis (— § 7 r) as the cube roots of unity. It has already been seen how the cube 
roots of unity can be applied to the solution of a geometrical problem such 
as Napoleon’s theorem, and in Chapter 13 we shall also see how they may 
be used to evaluate a determinant. Meanwhile we provide some simple 
examples on the nth roots of a number: 


Example 4 

Solve the equation z 6 + 1 = 0 
z 6 = — 1 = cis( 27 r k + n). 


So, by de Moivre’s theorem, z = [cis (2 nk + 7 r )] 1/6 = 


. 7i(2k + 1 ) 
cis- 


6 
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Taking k = 0, 1, 2, 3, 4, 5 we obtain z = cis 7i/6, cis tc/ 2, cis 57r/6, cis 77i/6, 
cis 37 t/ 2, cis 1171/6 


k u 



.71 .571 

=> z = + 1 , + cis— > or + cis — • 

” “ 6 “ 6 

Example 5 

Solve the equation (z — l) 5 + (z+l) 5 = 0 
5 

= — 1 — cis (2k + l)7i. 

So -- = cis -——— n (k = 0, 1, 2, 3, 4). 

z + 1 5 v ' 




2k + 1 

Writing a = --- n, 

5 


z 


1 + cis a 
1 — cis a 
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*37 


cis [cis + cis (—^a)] 

— cis [cis |-a — cis ( 

2 cos 2 a • i • (2£ + 1 )tc 

= l cot \0L = 2 cot --— 


- 2i sin ia 


10 


So the roots are (putting k = 0 , 1 , 2, 3, 4): 


71 

371 


5n 

77T 

9n 

= 2 COt-? 

i cot —» 

i cot 

—9 1 cot 

-9 

1 cot T7T ; 

10 

10 

10 

10 

10 

7C 

3tc 

0 , 

371 


7t 

= 2 COt-» 

2 COt- 9 

— 2 COt —9 

— 

1 c °t — ; 

10 

10 


10 


10 


7t 371 

2 = 0, +2 COt—’ ±2 COt— * 


(We expect an equation of degree 5 to have 5 roots. Note that there is one 
real root, together with two pairs of conjugate complex roots.) 


* Example 6 

Factorise x 2n — 2x n cos not + 1 . 

If x 2n — 2x n cos m + 1 = 0, 

then (* M — cos m ) 2 + sin 2 22 a = 0 

giving = cos noc ± i sin n<x = cis[± (non + 2 kn)~\ 



For k — 0, 1, 2,. .., n — 1, we obtain 2n roots, and these also occur in 
conjugate complex pairs. So the expression may now be expressed in 
factors: 


x 2n — 2x n cos n(x + 1 


= n [* - “(■+v)][* v). 

= "n cos + ~~j + l], 


combining the conjugate pairs of factors. Thus the expression has been 
converted to the product of n real quadratic factors, and once again we have 
achieved a result in real algebra by the use of complex numbers. 


Further examples on the use of de Moivre's theorem 
Example 7 

Express cos 4 6 and sin 5 9 in terms of the sines and cosines of multiple 
angles. 


I 3 & COMPLEX NUMBERS 

We note that z = cis 0 => z _1 = cis( — 0). 

So z + z“ 1 = 2 cos 0 and z — z“ 1 = 2i sin 0. 
Also z" = cis >20 and z _ " = cis( — nO). 

So z" + z" M = 2 cos n0; z" — z”" = 2 i sin rc0. 


Hence I z + 


= z 4 + 4z 2 + 6 + 4z 2 + z 4 


Also 


(2 cos 0 ) 4 = (z 4 + z 4 ) + 4(z 2 + z 2 ) + 6 

= 2 cos 40 + 4 x 2 cos 20 + 6 
16 cos 4 0 = 2 cos 40 + 8 cos 20 + 6 

cos 4 0 = ^(cos 40 + 4 cos 20 + 3). 

lY 

= (z 5 - z" 5 ) - 5(z 3 - z" 3 ) + 10(z - z' 1 ; 

=> (2z sin 0 ) 5 = 2z sin 50 — 5 x 2i sin 30 + 10 x 2 i sin 0 

=> 32 sin 5 0 = 2 sin 50 — 10 sin 30 + 20 sin 0 (since z 4 : 

=> sin 5 0 = yg (sin 50 — 5 sin 30 + 10 sin 0). 

Example 8 

Find the sum of the series 

S = sin a + sin (a + /?) + sin (a + 2/?) + • • • + sin [a + (w — 

Letting C = cos a + cos (a + /?) + cos (a + 2p) 

4-h cos [a + (n - 1 )/?], 

we have C + iS — cis a + cis (a + P) + cis (a + 2 j 8 ) 

4 -f cis [a + (n — 1 )/?]; 

which may be written 

C + iS = cis a + cis a cis P + cis a cis 2p + • • • + cis a cis (« - 

= cis a[l + cis /? + (cis p) 2 + • • • + (cis /?)" _1 ] 

cisa[l — (cis/?)”] 


1 — cis p 
cis a (1 — cis nP) 


1 — cis p 
Using the result of p. 122 , this gives 
sin y np . 


being the sum of a g.p. 


using de Moivre’s theorem. 


C + iS = 


cis [a + \{n — 1 )/?]. 


sin y p 

Finally, equating real and imaginary parts, 
sin \np 


C = 


sin tt P 


cos[a + — 1 )/?]; 


5 = 


sin ^np . 


sin jp 


sin [a + |-(/2 


1 ) 


i)/a 


i)/» 


- i)/a. 
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In the special case when a = jS = 0, we get 

^ sin in6 , , , ., n 

cos 9 + cos 20 H-h cos rc0 = ——cos ii 71 + !)“• 

sin 2 


Exercise 10.4b 

1 Show that (1 + 3z ) 4 = 28 - 96z, and find the other fourth roots of 
28 - 96z. 

2 Find: 

(i) the cube roots of ^ (cos 60° + i sin 60°); 

(ii) the fifth roots of 4^2 cis (— 80°); 

(Hi) the eighth roots of cos 160° — i sin 160°; 

(iv) the ninth roots of i ; 

(t;) the four values of (1 — sfi i ) 3f4 ; 

(vi) the sixth roots of i — y/S ; 

(vii) (~i) 3/4 ; 

(viii) four values of z to satisfy z 4 + 1 = i. 

3 Solve the equations: 

(■ i ) z 3 + 8 = 0 ; 

(ii) z 6 + z 5 + z 4 + z 3 + z 2 + z + 1 = 0 ; 

(Hi) z 4 — z 3 + z 2 — z + 1 = 0 ; 

(iv) z 5 - 1 = 0 ; 

(v) z 6 + 22z 3 + 125 = 0. 

4 Solve the equations: 

(i) ( 2 z - l ) 5 = (z - 2 ) 5 ; (ii) (z + i) 4 = 81z 4 ; 

(Hi) z 6 — 2 z 3 + 2 = 0 (iv) z 9 + z 5 - z 4 — 1=0 

In Exercises 5 to 9 , take to to be a complex cube root of 1. 

5 Prove that: 

(0 (1 + co)(l + CO 2 ) = 1; (ii) (1 + CO 2 ) 3 = -1; 

(Hi) Simplify (1 + co)(l + 2co)(l + 3co)(l + 5co). 

6 Prove by reference to the Argand diagram that 1 + 2co 2 is purely 
imaginary. 

7 Form the quadratic equations whose roots are: 

(i) 3 + co, 3 + co 2 ; (ii) 2co - co 2 , 2co 2 — co. 

8 Prove that 1 + co" + co 2 " takes the values 3 or 0 according as n is or 
is not divisible by 3. 

9 From the identity 

a 2 + b 2 + c 2 — be — ca — ab = (a + bco + cco 2 ) (a + bco 2 + cco) 

(a, b, c € C), 
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show that if £ a 2 = £ be , then either a — r = co(r — b) or else 
a — c — oj 2 (c — b) and interpret in the Argand diagram. 

10 Express the following in terms of multiple angles: 

(i) cos 5 9 ; (ii) sin 6 9 ; (m) sin 7 9 ; 

(«0 sinP # cos 4 9 in the cases where (/>, ?) = (3, 4), (4, 3), (4, 6), (3, 7), etc. 

11 Use the results of no. 10 to find the integrals 


sin 6 9 d 9 and 
12 Prove that 


J. si 


sin 4 9 cos 6 9 d 9. 


1 4 cos a 4 cos 2a 4 * • ■ + cos (n — l)a = 


sin \not cos j(n — l)a 


sin fa 

, , sin a + sin 2 a 4 -h sin not 

and that -----= tan Un 4 l)a. 

cos a 4 cos 2 a 4 • • • -fi cos not 

Obtain formulae for cos a — cos 3a + cos 5 a-cos (2 n - l)a 

and sin a — sin 3a + sin 5a — • • • sin (2 n — l)a. 

13 Use the result of no. 12 to find a formula for the sum of the series 
sin a + 2 sin 2 a + 3 sin 3a + • • • + n sin not. 

14 Find the sum to infinity of the series 

S = cos 9 sin 9 + cos 2 9 sin 29 4 - cos 3 9 sin 30 + • • • 

(consider the g.p. £ cos r 9 cis r9). 


10.5 Euler’s equation: cis 6 = e ie 

Let us now consider the function y = cos x + i sin x . Continuing to 
treat i like any other number, we have, by differentiation: 

d y 


dx 


= —sin x 4 i cos x = z(cos x 4 i sin x). 


d y 

Hence — = iu 
dx 

. dx 1 

^ 1 ~A~ = _ 

&y y 

=> ix = In y 4 r. 

But when x = 0, y — 1. So r = 0. 
=> ix = In y 
=> y = *4 


So 


cis x = r* 
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It may be objected that we have nowhere defined the meaning of a number 
such as e z when z is complex; but the reader should not be deterred by such 
inhibitions! Indeed, the above paragraph may be regarded as providing, 
if not a definition, at least a reasonable exposition of the meaning of 
e lx , making it consistent with the familiar processes of mathematics. 
For example, the rule for the multiplication of complex numbers: 
cis 9 l x cis 0 2 = cis {6 l + 0 2 ) may now be re-stated: e ldl x e lBl = e l(ei+d2 \ 
which is just what we should expect in a well-behaved system, ‘imaginary 5 
powers of e obeying the same rules as real powers. Indeed, the rule for 
addition of arguments ( 10 . 2 ) when multiplying complex numbers is now 
seen to be no more than a disguised version of the rule for addition of 
indices, or logarithms ! 

Again, de Moivre’s theorem may now be rewritten in the perhaps more 
familiar form ( e ix ) n = e inx \ and, as further support for the truth of the 
statement e ix = cis *, it will be remembered that we obtained the following 
infinite series by using Maclaurin’s expansion: 


cos x = 1 


x x 

2! + 4! 


sin x = x 



( X 2 x 4 

Then cis x = cos x + i sin x = I 1 — — • 

( x 3 x 5 

+ T~3! + 5!~ 

(for all real x) 


2 -3 4 

x* ix x 

i + JX — — — — + — + ' 

(ix) 2 (ix) 3 

1 + (ix) + ~ + ^T- + ' 


2 ! 


3! 


= 


being the infinite series for e z , with 2 substituted by ix. This further en¬ 
courages us to accept the validity of the expansion of the exponential 
function as an infinite series in cases when the exponent is imaginary. 

The above is neither a complete nor a conclusive argument: a great deal of 
analysis, involving the use of the exponential series £ z = 1 + z + z 2 / 2 ! + • • •> 
is needed to make the definition of e lx watertight. Meanwhile, however, we 
shall proceed on the basis that the above broad outline makes sense. 

The most remarkable result may be obtained by putting x — n: 

e m = cis n = — 1 + Oi = — 1 , 

so we have derived the equation due to the great mathematician Euler: 


* i7£ + 1 = 0 , 


which unites in one spectacular relationship 
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the five most important numbers in mathematics: 0 , 1 , n, e and i. 

Sense or nonsense? It must be true! Certum est quia impossibile est: 
Nothing so incredible could be an invention! 


Example 1 


Integrals such as §e ax cos bx d* and §e ax sin bx dx are usually found by two 
applications of integration by parts, a tedious procedure. We now look at 
a way of avoiding this tedium by using the exponential form of the complex 
number, working the method in a numerical case. 


/» 


Let C = e 3x cos 2x d* and S = e 3x sin 2x d*. 


*4 


4 


Then C + iS = e 3x (cos 2x + i sin 2x) dx — e 3x e 2ix dx 


4 


I 


_ e ( 3 + 2i)x _ 


A~3 + 2i)x 


-3 + 2 i 


13 


e 2ix 



e 3x (cos 2x + i sin 2x). 


Separating the real and imaginary parts, we obtain C and S : 

e~ 3x e~ 3x 

C = —— (- 3 cos 2x + 2 sin 2x) and S = —— (- 2 cos 2x — 3 sin 2x) 
1 o 13 

(arbitrary constants have been omitted). These perfectly real results have 
been achieved by the use, not only of complex numbers, but of exponential 
functions of complex numbers. Their truth should be verified by direct 
differentiation. 


Exponential functions of the general complex number 

If z = x + iy , then = <4 4* = e* cis y 

=> \e z \ = e x and arg e z — y. 

Moreover, we may now say that In ( e x cis y) = x + iy , 
or In (r cis 9) = In r 4- z0-(+ 2 nni). 

For example, 

In ( — 3) = In (3 cis n) — In 3 + in. 

Note that the logarithm of a (complex) number has an infinity of values 
corresponding to the infinity of arguments, e.g., In ( — 3) = In 3 + 3ni, or 
In 3 — 57n, etc. 



io .5 euler’s equation: cis 6 = e ld 143 
As a further application, suppose we want to find i\ We may write 
i = cis = e l * n . 

So i l — (e* 1 *) 1 = e~* n « 0.207. . the value, surprisingly, being real. 

Trigonometric functions as exponential functions ; hyperbolic functions 

Since cos x + i sin x — e lx , 
and cos x — i sin x = e ~ lx , 


we deduce, by adding and subtracting, that 



and discover the astonishing revelation that trigonometric functions, which 
started life as ratios of sides of a right-angled triangle, are really exponential 
functions in disguise. Moreover, these exponential versions of the trigono¬ 
metric functions bear a striking resemblance to the definitions of the 
hyperbolic functions (see 9 . 9 ): 

e x + e~ x e x - e~ x 

cosh x =---> sinh * = —— * 

2 2 

Hence we may now state the following relations connecting the circular 
and hyperbolic functions, which the reader may verify by direct substitu¬ 
tion: 


cosh ix = cos x 

sinh ix = 

i sin x 

cos ix = cosh x 

sin ix — 

i sinh x 


We have already, in 9 . 9 , noted the close resemblance between trigono¬ 
metric and hyperbolic formulae, e.g., for cos (a + b) and cosh {a + b), 
and we are now in a position to see how this arises. 

For cosh(fl + b) = cos i(a + b) 

= cos ia cos ib — sin ia sin ib 
= cosh a cosh b — (i sinh a) (i sinh b ) 

=> cosh [a + b) = cosh a cosh b + sinh a sinh b , 

compared with 

cos (a + b) = cos a cos b — sin a sin b. 

More generally, we can summarise these resemblances in Osborn’s rule, 
that the formulae correspond exactly provided that the sign is changed 
whenever there is a product of two sine or two sinh functions. Thus, again, 
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the formulae 

cos 2x = 1 — 2 sin 2 x cos 2 x + sin 2 x = 1 

cosh 2x = 1 + 2 sinh 2 x cosh 2 x — sinh 2 x = 1 

sin 2x = 2 sin x cos x 
sinh 2x — 2 sinh x cosh x 


may all be quoted in support of Osborn’s rule. 

We may also see that our statements regarding sin 6 and cos 6 as expo¬ 
nential functions are consistent with the various formulae in ‘real’ trigono¬ 
metry. For example, sin (0 — </>) = sin 0 cos (j) — cos 0 sin (j>; and if we 
substitute the exponential versions in the R.H.S., we get 


e ie - e~ ie e i<f> + e' i<f> 
x - 


e ie + e~ ie e i<f> - e~ i<f> 


2 i 


2i 


= _ S e iO + i<i> _ e ~ie + i<i> _j_ e ie-i<f> _ e -i 0 -uf> _ gffl + i<\> _ g -iO + i<j> 

Ai 


+ e**-*+ €'*-**) 

= — {2e i(e ~ <t>) - 2e~ i(e ~ <t>) ) = 
4z 


e W~4>) _ e -M-4>) 

2i ’ 


which is the exponential version of sin (0 — (j)). A similar verification may 
be achieved with any of the trigonometrical formulae. 

With the above equipment, we may now even make sense of trigono¬ 
metrical functions of complex numbers: 
sin z — sin (x + iy ) = sin x cos iy + cos x sin iy 

= sin x cosh y + cos x(i sinh y) 

= sin x cosh y + i cos x sinh y , 

so that sin z is a complex number, and we have separated it into real and 
imaginary parts. 


Example 2 
Find cos -1 2. 

Suppose cos -1 2 = x + iy . 

Then cos {x + iy) = 2 

=> cos x cosh y — i sin x sinh y — 2 

=> cos x cosh y = 2 (i) 

and sin x sinh y = 0 («) 

From (ii), sin x = 0 or sinh y = 0. 

Case i 

sinh y = 0 => y — 0 => cosh y = 1 . 
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Substituting in (z), cos * = 2 . 

But x is real, so this is impossible. 


Case 2 

sin x — 0 => x = nn(n e Z) => cos x = (— 1)". 

Substituting in (z), (—l)"cosh y = 2. 

Buty is real, so coshy ^ 1 => n is even (= 2k), and coshy = 2. 

Hence x = 2kn and y = cosh -1 2 = ±ln (2 + yj 3) 
and finally cos -1 2 = 2kn ± i In (2 + ->/3). 

Example 3 

For a circuit containing resistance R, inductance L and capacitance C , 
in series, the differential equation satisfied by Q, the quantity of electricity 
which has passed at time t, is LQ + RQ + Q/C = V 0 cos cot (where 
V 0 cos cot is an alternating input E.M.F.). Suppose that for a particular 
circuit, the differential equation arising is 

Q + 6 $ + 25Q = 60 cos 5 1, 

and that we wish to find the current flowing at time t. 

Seeking first the complementary function (see 9.8), we obtain the 
quadratic equation k 2 + 6k + 25 = 0 , so that k — —3 + 4 i. 

Thus the c.f. is Q = A /“ 3 + 4l >* + B e ( ~ 3 ~ 4i)t = e~ 3t (A e 4it + B e~ 4it ) 

= e~ 3t [A (cos 4 1 + i sin 4 1) + B (cos 4 1 — i sin 4£)] 

= e~ 3t {C cos 4 1 + D sin 4 1), 

where C = A + B and D = i(A — B) are a fresh pair of arbitrary con¬ 
stants to replace the pair A , B. 

The particular integral will be of the form Q = L cos 5t + M sin 5£. 
Direct substitution back into the differential equation enables us to find 
the values of L , M , namely L = 0, M = 2, so the complete solution of the 
differential equation is 

Q — e~ 3t (C cos 4 1 + D sin 4 1) + 2 sin 5 £. 

Hence the current I = Q = e~ 3t (E cos 4 1 + Fsin 4 1) + 10 cos 5t , where 
E, F are yet another replacement pair of arbitrary constants. Their values 
could be found from two pieces of information, such as the initial charge 
and current. However, since the factor e~ 3t diminishes rapidly with time, 
this term, arising as the complementary function, soon ceases to be of any 
consequence, and is known as the ‘transient’. The term 10 cos 5^ gives the 
value of the permanent current resulting in the circuit from the application 
of the given alternating input. 
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Exercise 10.5 

1 Express in the form x 4- iy : 

(i) e ii7C ; (ii) e~ in ; (Hi) e~ 1+ * in ; (iv) e l — e~ l ; (v) sinh (2 — 5t); 
(vi) cos (3 4- i); ( vii ) In ( — 8); {viii) In 3 i; (ix) In (^3 4- i); 

(x) In (-73 - 0- 

2 Prove that e x ~ iy = (cosh x + sinh x) (cos y — i sin y). 

3 Using the exponential definitions of sin and cos, prove the formula 
sin (9 -f (f>) = sin 9 cos (j) + cos 9 sin </> and other trigonometrical for¬ 
mula. 


4 Use cis 3x = e 3lx to integrate e 2x cos 3x and e 2x sin 3*. 

5 If sin (x 4 - iy) = r cis 0, prove that r = \ (cosh 2 y — cos 2x) and 
tan 9 = cot x tanh y . 

6 Find the real and imaginary parts and the moduli of: 

{i ) sin z; (ii) cos z; (Hi) tan z, where z = x + iy; 

(iv) Express the real part of tan z as -—-— --- 

cos 2x + cosh 2 y 

7 Find z so that sin z = — 3. 

8 If sinh (# + iy) = find x and y. 

9 Express u — e x cos y, v = e x sin y as a single equation in z( = x + iy) 
and w ( = u + iv). Repeat for u = cosh x cos y, v = sinh x sin y. 


10 Given that tanh z = w, express tanh 2x and tan 2 y in terms of u and v , 
where z = x + iy and w = u + iv. 


11 Functions c(x) and j(^) are defined by the formulae 

c(x) = ^(e ix 4 - e~ ix ) and s(x) — ^ ( e lx — e ~ lx ), prove that 

[ c(x)Y 4 - [^(#)] 2 = 1 and that \_c(x)~\ 2 — K #)] 2 = c(2x). 

12 Give the general solution of the differential equations: 

d 2 y dy 

(i) d?- 8 £ + 25 ^ 0; 

d 2 ^ 

(«0 -^~2 = 6 “ 4x ’> 


d 2 i/ dy 
d 2 x 

(iv) + 9x = sin 4 1. 

v 7 d ^ 2 


13 Obtain the solutions of the following differential equations subject 
to the stated boundary conditions (dots indicate differentiation with 
respect to time): 

(i) x — 6 x 4- 10x = 0 given that x = 0 and x = 3 at t = 0; 

(ii) y 9y = 4 , where y = 0 , y = 2 at t = 0 . 
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*10.6 Transformations in the Argand diagram 

Suppose that z is a typical point of a configuration in the Argand diagram, 
and let z' = z + a, where a is a fixed complex number. 


z 



Translation: z' = z + a 



Every point of the configuration undergoes a similar displacement specified 
by the (constant) vector OA. The transformation z' = z + a therefore 
represents a translation. For example, z' = z + 2 — 3i represents a trans¬ 
lation which carries the whole figure bodily 2 units in the positive ^-direc¬ 
tion and 3 units in the negative y-direction. 

Consider next the transformation represented by z' = pz, where p is 
real and positive. 



Its effect is to multiply the modulus of z by a factor p, whilst its argument is 
unchanged. For example, if p = 3, the point 2 cis 50° would map into the 
point 6 cis 50°, while the point 2 — 5 i would map into 6 — 15z. Thus the 
transformation is an enlargement of the original configuration from the 
origin in the ratio p: 1, the image of the configuration being similar to the 
original. The transformation is known as an enlargement (or magnification , 
similarity , dilatation , homothety) even when p < 1; and when p = 1, we 
obtain the identity transformation. 
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We have already seen in 10.2 that the transformation z — z cis 0 has 
the effect of a pure rotation about the origin, since the modulus is un¬ 
changed, while the argument is increased by the angle 0 . 



The final image will be congruent with the original. Note that when 
0 = ^7T, we obtain z = iz, and we have already remarked that multiplica¬ 
tion by i may be interpreted geometrically as a positive quarter-turn. For 
rotation about a point other than the origin , say about the point A, remember¬ 
ing that vector AZ is represented by z — a, and AZ’ by z! — a , we have 

z — a = (z — a) cis 0 . 



For example, ifz' = iz + 4, we may write z' — a = i(z — a) 

4 

where a = ia + 4, so that a = --; = 2(1 + i). 

1 — 1 

Thus z' — 2 — 2i = *( z — 2 — 2i), 

which represents a rotation through 90° about the point 2 + 2z. 
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All the above transformations have been ‘direct’ in the sense that the 
clockwise or anti-clockwise sense of description of a figure is preserved in 
each case. A mirror reflection, however, is an ‘opposite’ transformation, 
the figure now being ‘turned over’. The simplest case is reflection in the 
real axis, given by the equation z' = z*. Reflection in the imaginary axis 
would be represented by z' = — z* and reflections in other lines will be 
given by equations of the form z' = az* + b (a, b € C). However, the 
latter equation does not always give a pure reflection; for in the simple case 
z' — z* 4- a, where a is real, there is a reflection in the real axis combined 
with a translation parallel to it. Such a transformation is known as a ‘glide 
reflection’. 



Again, z' = pz*, where p is real (# 1), includes a similarity as well as a 
reflection. 
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Composition of transformations 

Argand diagram methods provide a simple means of finding the result of 
combining successive transformations in the plane. For example, a rotation 
^7i about O followed by a translation +4 is represented by z' = iz + 4, 
and we have already seen that this is equivalent to a single rotation about 
the point 2 + 2 i. 



It should be noted that the two transformations do not commute. Fir if the 
translation were followed by the rotation, we would get z' = i(z + 4), 
which is a rotation about — 2 + 2 i. 
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On the other hand, an enlargement z' = pz in the origin, together with 
a rotation z* = z cis (j) ( also about the origin) do commute, giving the com¬ 
posite transformation z f = p cis </> x z; i.e., z = az , where a is complex. 
This type of transformation we have already met, as a spiral similarity. 

Next consider the enlargement z' = 3z in the origin, together with the 
half-turn z' — i — — (z — i) about i. If the enlargement is done first, we 
get 

z' — i = — (3z — i) => z = —3z + 2 i. 

But if they are commuted, 

z' — 3 [i + — (z — z)] => z — —3 z + 6 i. 

In the first case, we may write z' = — 3z + 2 i in the form 
z — a — —3 (z — a), where 4a = 2 i, 

giving a = \i\ i.e., z' — \i = — 3(z — ji) 9 which is a spiral similarity 
about the point \i. You should find the centre of the spiral similarity in the 
second case when the half-turn is done first. 

The above discussion opens up the whole study of plane transformations, 
and the interested reader may find a more thorough treatment of ele¬ 
mentary transformation geometry of the plane in an article by F. J. Budden 
in the Mathematical Gazette , Feb. 1969, pp. 132-158. Here we must content 
ourselves with leaving further work to the exercises, though we shall 
continue the development of the subject when dealing with functions of a 
complex variable in 10.8, where we shall be concerned with some simple 
cases of non-linear functions. 


Exercise 10.6 


1 Interpret the following transformations, and find the invariant points 
where possible: 

(i) z' = z + 2 — 4 i; (ii) z’ 

(iii) z' = 5z — 2 i; ( iv) z' 

(v) z' — (3 — 4z)z; (vi) z f 

(vii) z' + 2 = \[z + 2); (viii) z' 

(ix) z — iz — 1 . 


= 2 — 4z — z; 
= 2 iz; 

= (5 cis 100 °)z; 
= z(l-i); 


2 Write down an equation of the form z’ = / (z) to represent the follow¬ 
ing: 

(i) A translation +3 in the direction of the imaginary axis. 

(ii) A spiral similarity with ratio 3:1 and angle of rotation +135° 
about O. 

(iii) A spiral similarity with ratio 3:1 and angle of rotation +135° 
about i. 

(iv) A rotation through — jtz about the point 1 — i. 

(v) A half-turn about 4 + i. 
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(vi) An enlargement 1:4 in the point 2 + i. 

( vii) A magnification x 3 in the origin followed by a translation —2 in 
the direction of the imaginary axis. 

(mii) The same as in (vii) but in the reverse order. 

(ix) An enlargement in the ratio 2 :1 in the point i followed by a rotation 
through about the point 2. 

(.*;) The same as in (ix) but in the reverse order. 

3 Analyse the transformations: 

(i) z 9 = iz + 1 - 1; (*i) z 9 = iz* + 1 - i. 

4 By using complex numbers show that successive half-turns about 
a and b are equivalent to a translation 2 AB. 

5 Interpret the following transformations: 

(i) z' = z*; (ii) z' = iz*; (Hi ) z' = z* cis 2a; 

(iv) 2' = iz* - 2; (v) z' = i(z* - 2). 

* 10.7 Loci in the Argand diagram 

As a complex number is an ordered pair of reals, it may be used to repre¬ 
sent any two-dimensional vector, such as a displacement, or a force, or an 
impedance operator in the theory of alternating currents. Such vectors 
are subject to change: a particle moves in a plane so that the complex 
number representing its position traces out a curve, the resistance or the 
frequency of a circuit is altered with the result that its impedance is altered, 
and the corresponding points on the Argand diagram move. We shall 
therefore be concerned with the locus of a point 2 in the Argand diagram, 
moving subject to certain conditions. The equations of such curves may in 
certain cases be expressible algebraically in terms of 2. For example, a 
circle centre O and radius p may be written 
\z\ = p or zz* = p 2 , 

while the half-line through O making angle 6 with the real axis has equation 
arg 2 = 0 . Similarly, the equation arg (2 + 2 i) = 300 ° represents the 
half-line terminating at the point -2i and proceeding in the direction 
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The equation of any circle, centre A and radius p , may be written 



This can be rewritten in the usual Cartesian form, 
x 2 + y 2 + 2 gx + 2fy + c — 0 
by replacing z and z* by x + iy and x — iy. 



such an equation expressing the geometrical property that the point Z 
moves so that the ratio of its distances from two fixed points A and B is 2, 
the locus under such conditions being called the ‘circle of Apollonius’. 
Likewise, the equation * 



a 
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represents a circular arc terminated by the points A, B; for arg - — - 

is the angle of turn from BZ to AZ , and so the equation expresses the 
property that this angle is constant. In our figure we show the cases where 
</> = 50°, 130°, -130°, and -50°. 



We may invent other examples of loci based upon our geometrical 
knowledge; e.g., the locus of Z so that AZ + BZ = constant is an ellipse, 
and this may be expressed in complex numbers as |z — a\ + |z — b\ = X 
(Xe R + ). Again, the inequality |z| < 1 represents the whole of the interior 
region of the unit circle; while 1 < |z + i\ ^ 3 represents the region 
lying between the circles centred at the point —i with radii 1 and 3, includ¬ 
ing the circumference of the latter circle. 

Exercise 10.7 

1 Describe the loci when z moves in the Argand diagram subject to the 


following conditions: 



(0 

M = 2 ; 

(it) 

\z - l\ = \z+ 11 ; 

(in) 

jz - t| = \z+ l|; 

(iv) 

z = z*; 

(») 

arg(z - i) = -jn; 

(vi) 

|z| + |z — 1 + i\ = 2 ; 

(vii ) 

z - l 1 

arg z+ i=^ ; 

(viii ) 

|z| = 3|z- 11; 

(w) 

arg(z - 1 ) - arg(z + i) = n; 

(*) 

|z- l| + |z+ 1| = 4. 


2 Shade the regions of the Argand diagram for which z is subject to the 
following restraints: 

O') |z| < 3; (it) |z + 2| > 2; (Hi) \z + 4i| ^ 3; 

(iv) \z — 3 + i\ > 2; 0) |z—l| + |z+l|^4; (vi) 

3 From the equation |z + 16| = 4|z + l|, deduce that |z| 
interpret geometrically. 



= 4, and 
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4 Show that | z + 2i\ = |2 iz — l| represents a circle, and find its centre 
and radius. 

5 Express |z/(z — i)\ = 4 using z and z* instead of modulus signs. Repeat 
for |(z — a)/(z — b)\ = X. 

6 If z moves along the semicircle connecting the points i, 1 and — i, 
describe the loci of the following: 

(£) 3z; (ii) —2z; (iii) iz; (iv) 2 + iz; 

(v) 2 z*; (vi) z 2 ; (t>££) z 2 — 1 ; (viii) z 1/2 . 

7 If z moves along the real axis from — 1 to +1, find the locus of 

(1 - iz)/(z - i). 

8 If (z — i)/(z — 1) is purely imaginary, show that the locus of z is a 
circle centre ^(1 + £), radius 2 ^/ 2 . 

9 If z describes the semicircle from the point 1 through the point i to the 
point — 1 , and thence the part of the real axis from — 1 to + 1 , describe 
the locus of: 

(£) z + £; (ii) 2 z; (iii) iz; (iv) z 2 ; (v) 1 /z. 

10 Ifz describes the square 0 , 1 , 1 + £, i, 0 , describe the locus of: 

(£) z 2 ; (ii) e z ; (iii) cosh z. 


*io.8 Functions of a complex variable 

When x is a real variable, the function f (x) is usually illustrated by a 
graph. Alternatively, however, it can be represented as a mapping from a 
set of values of x (its domain) onto a set of values of f (*) (its range) by an 
arrow diagram. 
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For instance, 

f:x + 1 

can be represented by 




If z is a complex variable, the function f (z) can no longer be shown in 
the first of these ways, as a graph, and has to be shown as a mapping, this 
time from a two-dimensional set of points (its domain) in one Argand 
diagram onto a set of points in another Argand diagram (its range). 



So, just as we can speak of y = f {x) as a mapping/: x \-+y of points from 
an .x;-line into a y-line, so we can also speak of w — f (z) as a mapping 
/: z h-> w of points from a z-plane into a w-plane. 

For example, the function w = 2z H- 3 — i maps points as follows: 



0 

U 

I 

j 

V 

W 

A 

B 

K 

z 

0 

1 

i 

— i 

-1 

1 + i 

2 

-2 

— 1"2 + j>i 

w 

3 — i 

5 — i 

3 + i 

3-3? 

1 -i 

5 + i 

1 -i 

-1 -i 

0 


O' 

U' 

r 

J' 

V' 

w 

A' 

B' 

K' = O 
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^ z-plane 
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But it is rather simpler in this case to think of the 2 -plane and w-plane as 
being superimposed, and to regard w = 2z + 3 — i as carrying the 
point 2 to a new position, first by the enlargement in the origin z' = 2z, 
followed by the translation w = z' + 3 — i. If z belongs to a configuration, 
then the whole of the configuration is carried by the transformation to the 
new position. 



Such mappings are extremely important when the theory of complex 
numbers is applied to physical problems, ranging from the investigation 
of streamlines of fluids to that of lines of electric force and equipotentials, 
and to the study of the behaviour of oscillating circuits. Within mathe¬ 
matics itself, functions of a complex variable are equally important: 
exploration of what meaning can be given to such ideas as continuity, 
differentiation and integration has led to one of its most fertile areas. But 
this is a journey which cannot be undertaken here, and we confine ourselves 
to investigating a few examples. 
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The general linear function 

It is clear that the general linear transformation 


w = az + b (a, b e C) 
or w = (p cis 0 )z + b (where a — p cis 0 ) 

may be analysed into basic elementary steps by splitting the formula as 
follows: 

Zj = pz an enlargement in O] 

. , __ > spiral similarity 

z 2 = Zj cis 0 = az a rotation about O J 

w — z 2 + b = az + b a translation. 

Thus the general linear transformation may be composed of enlargements, 
rotations, and translations, and so must preserve straight lines. When 
p = 1 , it will preserve distance (and so be an isometric transformation); 
when p + 1 the image will be similar to the original figure. The example 
w — 2z + 3 — i was such a transformation, though the element of rota¬ 
tion was absent because the value of a (= 2 ) was real (so that 0 = 0 ). 

The reciprocal function , w = l/z 

The function l/z is immediately seen to be quite different in that it does 
not preserve straight lines. We may see this easily by considering the three 
collinear points 

U(1 +00, S (1 +0, T (1 - 0, 
which have as images: 

U'(l + 0«-), S'1(1 -i), Ti(l+i); 
and these do not lie on a straight line. 


z-plane A w-plane 



The transformation represented by w = l/z may be investigated geo¬ 
metrically, but this requires a knowledge of inversion and we content our¬ 
selves with a simple case, using analytical methods. 
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Example i 
1 


If w = -, where w = u 4 iv and z = x 4 iy , 
z 


then u 4 iv = 


so that u = 


1 x — iy 

. o 9 * 

* 4 iy x+y 

Y 

and 


x 2 4 y 2 


v = — 


* 2 + */ 2 


Consider a line u = constant = A in the w;-plane. 


Since m = 


x 2 -f y 2 


= A, we have 


^ 2 + y 2 — -x = 0, and this is a circle in the z-plane. 

Similarly the line y = fi in the z-plane will lead to the circle locus whose 
equation is 

u 2 + v 2 - v = 0 in the w-plane. 

A* 

Example 2 

If z traces out the circle |z| = 1 anti-clockwise from U back to U, find 
the locus of w = 


z + 1 

This could be done by writing w — 1 —. 
down into 


and breaking this 


= z + 1 , 




z + 1 


Zq — 2 Zn — 


z + 1 


W = Zq 4 1 = 1 — 


z — 1 


z 4 1 z 4 1 


z 4 1 

a translation; 

the reciprocal transformation; 
a half-turn and enlargement; 
a translation; 


and then finding the geometrical effect of each of these on the unit circle. 
Or we may take z = cis 0, so that 

cis 6 — 1 cis iO — cis (— iO) 2 i sin . _ 

^ ' — _ t— — * tan —n 

cis 6 4 1 cis ^6 4 cis(— \6) 2 cos-^-0 2 

Thus, as 9 goes from 0 to 7t, w traces out the imaginary axis from O to 

infinity; whilst as 6 goes from n to 2n, w traces the other half of the imaginary 

axis from — oo up to the origin. 



l6o COMPLEX NUMBERS 


There is, however, a simple geometric solution. For if U, V are the 
points 1,-1 (our usual notation), then z — 1 and z + 1 are represented 
by the vectors UZ , VZ . The fact that these are perpendicular (Z_UZV being 
the angle in a semi-circle), means that their quotient w must be ‘purely 
imaginary’, or a real multiple of i. Hence w lies on the imaginary axis. 

Alternatively we may rearrange the formula as z = ———. Since 

1 — w 

|z| = 1 , it follows that |l + w\ = |l — w\, so that w is equidistant from the 
points — 1 and + 1 , and therefore lies on the imaginary axis. 


Example 3 

If w = z 2 , find the locus ofz when w traces the lines u = constant,^ = con¬ 
stant. 

u T iv = (x + iy ) 2 = x 2 — y 2 + 2 ixy\ 
so u = x 2 — y 2 , v = 2 xy. 

Hence u = constant gives a family of rectangular hyperbolas, 

X 2 - y 2 = 1 (e R), 

v = constant gives another family of rectangular hyperbolas, 
xy = \i (efi). 

All mutual intersections are perpendicular and the families are said to be 
orthogonal. 



Example 4 

Express u = e~ x sin z/, v = e~ x cos y as a relation between w and z. 

w = u + iv = e~ x (sin y + i cos y) = e~ x i (cos y — i sin y) 

= ie~ x e~ iy = ie~ z . 
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Polynomial functions: the fundamental theorem of algebra 

Earlier in this chapter the reader may have had the uneasy feeling that 
there may just possibly exist a certain polynomial function of the form 

/(z) = a 0 z n + fljZ " -1 -I- b a n _ l z + a n 

with the property that there is no value of 2 whatsoever which makes 
f (z) = 0 . Certainly there are quadratic equations which have no solutions 
in the field of real numbers, and it might reasonably have been supposed 
by analogy that there are also polynomial equations which have no solu¬ 
tions even in the much wider field of complex numbers. The reader should 
not be ashamed of harbouring such doubts, for the same question troubled 
many mathematicians for ages. It was finally resolved, and doubts dis¬ 
pelled, by the so-called fundamental theorem of algebra , which states that for 
every polynomial f (z) there does indeed exist a value of z (e C), which 
satisfies the equation f (z) = 0 . In other words, every polynomial equation 
has at least one root in the field of complex numbers, and this is still true 
even when the coefficients, a 0 , a l , . . ., a n are themselves complex. So the 
theorem, which is an example of what mathematicians call an existence 
theorem , is really a very powerful result; and, once proved, it is a simple 
matter to prove further that a polynomial of degree n has n roots in complex 
algebra (repeated roots being counted according to their multiplicity). But 
the proof of the fundamental theorem of algebra is difficult, involving 
sophisticated analytical concepts. For readers who wish to pursue the 
matter, a traditional proof may be found in Hardy, G. H., A course in pure 
mathematics (Appendix II) or in Dickson, L. E., First course in the theory of 
equations (Appendix). But more modern methods of algebraic topology 
enable the proof to be considerably shortened. 


Exercise 10.8 

1 If w = (z — 1 )/(z T 1 ) (as on p. 159), show that when w traces out the 
lines u = constant and v = constant, the locus of z is two systems of 
circles. Find also the locus of w as z traces the upper half of the circle 
diameter — 1, + 1. 

2 If w = (z — z)/(z + i) and lies below the real axis, show that w lies 
outside the unit circle \w\ = 1 . How will w move as z travels along the real 
axis from — 00 to + 00 ? 

3 If w = (z — 6z)/(z + 8), show that the locus of z is a circle when w 
traces the imaginary axis, but a line when w traces the real axis. Find the 
equations of the z-loci. 

4 Examine the effect of the transformation w = (z + 2 )/(z + i) on the 
system of concentric circles having centres at the origin in the z-plane. 
What is the image of the exterior of the circle |z| = 1 ? 
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5 If w = (z — a)/(z — b) verify that w is obtained by making triangle 
0,1 , w similar to triangle z, b , a . 

6 If w = (z + l) 2 and z traces the unit circle, show that the locus of w 
has polar equation r = 2(1 + cos 6 ) ( a cardioid). 

7 Show that the lines x = constant and y = constant in the z-plane are 
transformed by w = cosh z into sets of confocal ellipses and hyperbolas in 
the w-plane. 

8 What is the effect of w — e z on the system of lines x = constant, 
y = constant in the z-plane? 

9 If w = 4 /(z + l) 2 and |z| = 1 , show that w describes the parabola 
whose polar equation is r = sec 2 \ 0 . Show that the interior of the circle 
maps into the exterior of the parabola. 

z + 5 

10 Find the invariant points of the transformation w = ----— 

F 2 + i(z — 1) 

11 If (1 — z) (1 — w) = 1 and arg z = find the locus of w. 

12 What is the effect of w = cos z on the lines x = constant, y = constant 
in the z-plane? 


Miscellaneous problems io 

1 Solve the equation z 4 + 10z 2 + 169 = 0 by first solving for z 2 , and 
then for z in the form a + ib where a and b are real. Hence or otherwise, 
express z 4 + 10z 2 + 169 as the product of two real quadratic factors. 

1 1/1 1 \ 

2 Show that for real x, - -5- — — I-:-: • 

1 + x 2 i\x — 1 x + 1/ 


Deduce that 
d"" 1 / 1 


dx n 


1 + 


1 


= (-I)"' 1 (n ~ 1)!^ 


1 


1 


2 i\(x - i) n {x + i) n 


Prove that, if x — i = r (cos 6 — i sin 0 ), 

then (x — i)~ n = r~ n (cos n 6 + i sin n 9 ), 

and obtain a similar expression for (x + i)~ n , 
Hence obtain a formula for 


1 


in terms of r and 0, 


dx n ~ l \l + ^ 2 
and deduce that for 0 < x < ^7t, 


^(tan 1 *) = (-!)" 1 ( J W + Jl/2 sin (” cot 1 x )- ( M ' E - 1 -) 
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3 What geometrical transformation is given by 

2 ! — c = (z — c) (cos 6 + i sin 0 ), where c is a complex constant? 

If the transformation z t-> z corresponds to a turn of 120° about the 
point ( 1 , 0 ), and the transformation z' z" corresponds to a turn of 60° 
about the point ( — 3, 0 ), show that z" — — z + 2^/3z, and hence express 
in geometrical terms the single transformation equivalent to the two 
transformations in the order given. (s.m.p.) 

4 (i) Given that one root of the equation z 4 — 4 z 3 + 12z 2 + 4z — 13 = 0 
is 2 — 3 i, find the other three roots. 

(ii) Given z = 2 (cos n/6 + i sin n/6) illustrate on the Argand diagram the 
points representing the complex numbers z, z*, (z*) 2 , (z*) 2 /|z|. 

S is the set of points in the first quadrant which lie on the circumference 
of the circle |z| = 2. Find the image of£ under the mapping z i-» (z*) 2 /|z|. 

(m.e.i.) 

5 The vertices of a triangle in the Argand diagram are given by the com¬ 
plex numbers 0 , b , c. Say whether the following are true or false, with 
reasons: 

(i) The centroid is'at the point \(a + b + c). 

(ii) The area of the triangle is^[(£ — c) 2 + (c — a) 2 + (a — b ) 2 ~\. 

(in) The triangle is equilateral if and only if 

(a — b) (a — c) + (b — c) 2 = 0 . (o.s.) 


6 Let f (z) = \(z + z _ l ) where z is a non-zero complex number. 

(■i) If z describes a circle |z| = r where r is a positive real constant, 
show that if r / \,f(z) describes an ellipse in the complex number plane. 
What happens if r = 1 ? 

(ii) If z describes the half-line arg z = 0 , where 6 is constant, prove 
that, in general (i.e., if certain exceptional values of 6 are excluded) 
f (z) describes one branch of a hyperbola. 

(in) What are the exceptional cases in (ii)? Describe what happens for 
these exceptional cases. (o.s.) 

7 If |z — l| = 1 , prove that arg(z — 1 ) = 2 arg z = f arg (z 2 — z), 

and give a geometrical interpretation of these results. (o.s.) 


8 Show that the roots of (z + i ) 5 = (z — i ) 5 are cot kn /5 (k — 1, 2 , 3 , 4 ), 
and hence evaluate cot 2 7c/5 + cot 2 27 t/ 5 , and cot 7t/5 cot 27 t/ 5 . Generalise 
the result and thence obtain 


n 


X c °t 2 


kn 

2 n + 1 


= $«( 2 » - 1 ). 


9 Prove that the roots of 
(z + 1)" + (z - 1)" = 0 are 


. ( 2 k + l)n 

1 cot- 


2 n 


(A = 0,1,2,..— 1). 
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Deduce that 


"- 1 ( 2 k + 1 ) 7 T J V 2 ( 2 k + 1)71 2 

V cot -= 0 and Y cot 2 -——-= rr — n. 

7 2n 7 

10 Show that 


2 n 


(1 + x) 2 " + (1 - x) 2n = 2 n( X 2 + tan 


( 2 k 


1 )7T 


4 n 


and prove that 

( 2 k - \)n 


sec 


4 n 


= 2 n 2 . 


11 Prove that e 2knl = 1 for all integral values of k , and that e ** 1 = i. 

Deduce that a value of i l is e~^ n , and using the result e 2kni = 1, find a for¬ 
mula for all possible values of i l . (c.s.) 

12 If cos z ~ w, prove that 


cos 2 x sin 2 x 


= 1 and 


= 1 


cosh 2 y sinh 2 y 
and discuss interpretations in terms of the Argand diagram. 


13 For a rotation through a and about O, we have 
z f = z cis a or x' + iy ' = (x + iy) (cos a + i sin a). 

Equating real and imaginary parts gives: 

x' = x cos a — y sin a) , . /cos a — sin a 

>, so we get the rotation matrix 

y = *sinoe + z/cosocj \sina cos a 

Show that reflection in the line y = x tan a can be represented 
z' = z* cis 2a. 

By equating real and imaginary parts, find the matrix for this reflection. 



II 


Further vectors and mechanics 


11. i Points of subdivision 

Suppose that two masses are attached to the ends of a light rod AB, 2 kg 
at A and 3 kg at B. 


A 


2 * 

a 


B 


b 


3 


It is found from elementary experiments that such a rod will balance at 
the point P, 

where 2 AP = 3 PB, 

so we call this the ‘point of balance 5 . 

If we now denote the position vectors (from an origin O) of A, B, P by 

a, b,p , 

then 2 {p — a) = 3(6 — p) 

=> bp = 2 a + 36 

2 a + 36 


P = 


5 
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More generally, let us consider two masses, a at A and at B. 

A p B 

« *- 1 -• P 

a p b 


Then their ‘ point of balance ’ would be at P, where 
ocAP = iBPB. 

So oc(p — a) = 18(6 — p) 

=> (a + P)p = oca + fib 
oca + fib 

p = 

It is clear that only the ratio a:/? is important. For if a and (3 were both 
multiplied by the same number K, 

kolAP = KpPB => = f$PB 

. Koca + Kpb oca + fib 

and p = - = -5 

KCC + Kp OC + P 

so that the position of P is unchanged. 

When, however, the ratio changes, P will move along the line AB; and 
if either of the numbers a, jS were allowed to be negative, P would fall 
outside the segment AB and be either beyond A or beyond B. 


Example i 

(0 oc = -%p = + 3 . 

So — 2 AP = 3 PB and P is beyond B. 

-2_+3 

--* 

A B P 

Furthermore, p 

(ii) a = + 2,/J = 

So 24 P = -PB 

2 

P A B 


— 2 a T 36 
-2 + 3 


= — 2 ci -f 36 . 


- 1 . 


and P is beyond A. 
-l 


Furthermore, 



6 . 


The only exceptional case is when oc — —ft. 

Then ocAP = -ocPB => AP = -PB , 

which is impossible when A, B are distinct 

oca ~ cub oca — ocb , . . . . . N 

(and p — -— - 9 which is meaningless) 

a — a 0 
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1 If A, B and P have position vectors a, 6 and p, find p in terms of a and b 
and illustrate by a sketch when: 

(i) AP = PB; (ii) ZAP = PB; (Hi) AP = 3 PB; 

(iv) ZAP = —PB; ( v) AP = -3 PB. 

2 If A, B have position vectors a, b , illustrate the points whose position 
vectors are: 

(0 ja + }b; {ii) fa + jb; (iii) fa + ffr; 

(iv) \a + (z;) 2 a — b; (vi) Za — 2b; 

(vii) 4a — 36; (wii) 26 — a; (zx) 36 — 2a; 

(*) 46 — 3a. 

3 The points A and B have position vectors 6 i — 5 j and 4 i — Zj res¬ 
pectively. Show that the mid-point M of AB is collinear with the two 
points X, Y whose position vectors are 2 i — 6 j and 1 It respectively, (o.c.) 


Points of subdivision 
If ocAP = fiPB, 
then a AP = fiPB 
AP _ j 8 
PB~ a’ 


and 


so P is said to divide AB in the ratio P : a. 


Example 2 

(*') a = 2, P = 3. 

So 2AP = 3P5 

=> - = i 

PB 2 

and P divides AB in the ratio 3:2. 
{ii) a = - 2, = 3. 

So 24P = -3PP 

AP 

=> — = —4 

PP 2 

and P divides AB in the ratio 3: — 2. 




A 



a 




2 
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(Hi) cl = 2, p = — 1. 


So 


— 2AP = 
AP _ 
PB ~ 


PB 


1 

2 





- 1 


and P divides AB in the ratio —1:2. 

Summarising, we see that the following statements are all equivalent: 

P is the centre of mass of a at A and ft at B 
clAP = (}PB 

oca + (3b 

p= -VTT 

P divides AB in the ratio /?: a 


Example 3 

Find the position vectors of the points P, Q which divide AB in the ratio 
1 :2 and —5:2 respectively. 

(i) P divides AB in the ratio 1:2 

=> P is the 4 point of balance 5 of 2 at A and 1 at B 


P = 


2 a + 16 

2 + 1 


ia + 4-6. 


(ii) Q divides AB in the ratio 5: — 2 
=> Q is the ‘point of balance’ of —2 at A and 5 at B 
— 2 a + 56 


-2 + 5 


= -fa + f b. 


Exercise 11.1b 

1 The vertices of AABC have position vectors a, 6 , c. Find, in terms of 
a, 6 , c , the position vectors of 

(i) D. E. F. the mid-points of BC, CA, AB respectively; 

(ii) the point X which divides the line AD in the ratio 2:1, and the cor¬ 
responding points Y, Z dividing BE, CF in the same ratio. 

Hence show that the medians AD, BE, CF are concurrent (at a point 
called the centroid of AABC). 

2 The vertices A, B, C of AABC have position vectors a, 6 , c. Find, in 
terms of a, 6 and c, the position vectors of 

(i) P, which divides BC in the ratio 3:2; 

(ii) Q, which divides CA in the ratio 1:3; 

(Hi) R, which divides AB in the ratio 2:1; 
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(iv) X, which divides AP in the ratio 5:1; 

(u) Y, which divides BQ in the ratio 2:1; 

(yi) Z, which divides CR in the ratio 1:1. 

Hence show that AP, BQ, CR are concurrent. 

3 In no. 2 find also the position vectors of 

(i) U, which divides BC in the ratio —3:2; 

(ii) V, which divides CA in the ratio —1:3; 

(Hi) W, which divides AB in the ratio —2:1. 

Hence show that: 

U also divides QR in the ratio —3:4; 

V also divides RP in the ratio —5:3; 
and W also divides PQ in the ratio —4:5. 

4 If the points A, B, C have position vectors a, 6 , c from an origin O, 
show that the equation 

r = ta + (1 — t)b, 

where t is a parameter, represents the straight line AB, and find the equa¬ 
tion of BC. 

Find the equation of the straight line joining L the mid-point of OA to M 
the mid-point of BC. Find also the position vector of the point in which the 
line LM meets the straight line joining the mid-point of OB to the mid- 
\ point of AC. (l.) 

5 OABC is a square of side 2 a. i,j are unit vectors along OA, OC. The 

mid-point of AB is L; the mid-point of BC is M; OL, AM meet at P; BP 
meets OA at N. Show that the segment OP can be measured by the vector 
>1(2 ai + aj) and also by the vector 2 at + fi(2aj - ai). Hence determine X 
and /i. Prove that ON — § OA. (o.c.) 


11.2 Centres of mass 

In the last section, when masses a and /? were placed at two points whose 
position vectors are a and 6, we referred to 



as their ‘point of balance’. 

Let us now suppose that we have a number of masses: 
m l , m 2 , m 3 , m 4 . . . 
at r x , r 2 , r 3 , r 4 . . . respectively. 




Let G be the point which has position vector r, where 

_ m x Y x + m 2 r 2 + m 3 r 3 + m 4 r 4 + • • • 

r — --- 

m x + m 2 + m 3 + m 4 + • • • 

Now G is a definite point, but there is a serious risk that its position will 
depend on our choice of origin, O. 

To discover whether or not this is so, we must take another origin O' and 
see whether such a calculation based on O' leads to a different point G'. 
Let us suppose that from the new origin O', O has position vector c, and 
m x , m 2 , . . . have position vectors r \, r 2 ,. . . . 



So the position vector of G', calculated from O', is 
m x r\ + m 2 r 2 + m 3 r 3 + • • • 

Y — -- 

m l + m 2 + m 3 + • • • 

_ m 1 (c + Yj) + m 2 [c + r 2 ) + m 3 (c + r 3 ) + • - - 
m x + m 2 + m 3 + • • • 

(m x + m 2 + m 3 + • • *)c + + m 2 r 2 + m 3 r 3 + * * 0 

m x + m 2 + m 3 + • • • 

m x Y l + m 2 Y 2 + m 3 Y 3 + • • • 

= C H-;- 

mj + m 2 + m 3 + • • • 

So y' = c + Y. 

Hence OG ' = O'O + OG = O'G 
=> G and G' are identical. 

So if masses m x , m 2 , m 3 . . . 


are at points y x , r 2 , r 3 . . ., 
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the point 


m l r l 4 m 2 r 2 4 m 3 r 3 4 
m x 4 m 2 4 m 3 4 • • • 


is not dependent on our choice of origin, but only on the given system of 
masses, and is therefore called their centre of mass. In the particular case 
when the masses are all equal this becomes 

r = -(r, + r 2 + ■ ■ ■ + rj, 


which is also known as the centroid of the given points. So the centroid of 
r x and r 2 is at \{r x 4 r 2 ), and of r x , r 2 , r 3 is at \{r x 4 r 2 4 r 3 ). 
Finally, in terms of components, the centre of mass is 


xi + yj + zk — 


m 1 (x l i + yj + z t k) + m 2 {x 2 i 4 yf + z 2 k) + • 
m l + m 2 + • • • 

. . miVi + m 2 y 2 + • • • . 

t H-r-;- j 


m l x l + m 2 x 2 4 ■ 


4 


m x 4 m 2 
m x z x 4 m 2 z 2 4 • • 


m x 4 m 2 4 ■ 


m x 4 m 2 4 • • • 
So the centre of mass has coordinates 


k. 


_ m x x x 4 m 2 x 2 4 
x — 


m x 4 m 2 4 • • • 
m x y x 4 m 2 y 2 H- 

y = - 

m x 4 m 2 4 • • • 

_ 4 m 2 z 2 4 • • 


or 


X mx J] my ^mz 
-, -, - 


m x 4 m 2 4 ■ 


Example i 

A light rectangular frame of length 4 m, breadth 3 m and height 2 m has 
masses 1 kg, 2 kg, 3 kg attached at corners as shown in the figure. Where 
is their centre of mass ? 
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Taking unit vectors i,j, k as shown, the three masses are: 

1 kg at the point 4 i + 3 j 

2 kg at the point 4i + 2k 

3 kg at the point 37. 

So their centre of mass is at 


_ 1 x (4 i + 3j) + 2 x (4 i + 2 k) + 3 x 3 j 

1+2 + 3 


12* + 12 j + 4 k 


= 2* + 2 j + f*. 


Alternatively, taking axes Ox, Oy, Oz in the same directions as *, j, A:, 
the calculation could be set out: 


m 

X 

y 

z 

mx 

my 

mz 

1 

4 

3 

0 

4 

3 

0 

2 

4 

0 

2 

8 

0 

4 

3 

0 

3 

0 

0 

9 

0 

'Em = 6 




X mx = 

12 Y.my = 12 

J] 772Z = 4 

So the centre of 

mass has coordinates 



_ X mx 

0 . 




X>Z 2 


x — 


y = 

^ =2 

; ^ = 

I« 3 



and is therefore at the point (2, 2, f). 


Example 2 

If a square of side a is removed from the corner of a uniform square lamina 
of side 2 a, where is the centre of mass of the remaining piece? 
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We can regard the remaining piece as being composed of a large square of 
mass 4 together with a smaller square of negative mass — 1. So, using 
axes Ox , O y as shown, we have: 




m 

X 

y 

mx 

my 

Large square 


4 

a 

a 

4 a 

4 a 

Small square 


-1 

\a 


—\a 


Final lamina 

X 

m = 3 



X mx = \a 

Y.my = fa 

e _ T.mx 

la 


Z my _ 

la 



A m 

IT 

’ y = 

Z m 

6~ 



and the centre of 

mass lies at (7a/6, la/6). 




Exercise 11.2a 

1 If the vertices of a tetrahedron ABCD have position vectors a, 6 , c, d , 
find the position vectors of 

(i) the centroid P of ABCD; 

(ii) the point dividing AP in the ratio 3:1. What do you deduce about 
this and the three other similar lines ? 

(iii) the mid-point of the line joining the mid-points of AB and CD. 

Hence find seven lines associated with the tetrahedron which are all 

concurrent. At what point do they meet? 

2 If G, G' are the centroids of AABC, AA'B'C', prove that 

AA + BB' + CC = 3 GG\ 

3 Find the centres of mass of 

(i) 1 kg at a, 2 kg at b, 3 kg at c\ 

(ii) 1 g at (0, 0, 0), 4 g at (1, 4, 2), 3 g at (3, - 1, -2), 2 g at (-4, 1, 6 ); 

(iii) 1 kg at i + 2/, 4 kg at j — 2 k, 5 kg at — i + 3 / + k. 

4 If a cube of side 1 m is removed from the corner of a uniform solid 
cube of side 2 m, how far has its centre of mass been displaced? 

5 A mass of 2 kg is placed at the point (4, 1). Find the masses which 

should be placed at ( — 1 , 3 ) and ( — 2 , — 1 ) in order that the centroid of 
the three masses should be at the origin. (s.m.p.) 


Continuous distributions of mass 

So far, we have limited our investigations to systems consisting of a finite 
number of discrete masses. When there is a continuous distribution of mass, 
the method is very similar, except that we shall usually begin by dividing 
the given body into a series of elements. 
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Example 2 

Uniform triangular lamina 



We first divide the lamina into a set of elemental strips parallel to one of its 
sides. Now the centre of mass of each strip is at its mid-point, and these are 
all collinear on a median. Hence the centre of mass of the lamina lies on 
this median, and similarly on the other two medians. It is therefore situated 
at the centroid of the given triangle. 

Example 3 

Find the centre of mass of: 

(i) A uniform lamina (or plate) in the shape of the area between y = x 2 , 
y = 0 and x = 1. 

(ii) A uniform solid formed by rotating this area about the x-axis. 

(i) Uniform lamina 



Suppose that the lamina has surface density (i.e., mass per unit area) <r, 
and that it is split into narrow strips of width bx and height y. 

Then the strip shown has mass m — ay bx which can be regarded as 
concentrated at the point (x, |y). 
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_ X mx Z {°y $ x ) x Z x y $ x 
S ° Z rn Y,i°y Sx ) Z y & x 

XJ dx fo X 3 dx = { = 3 

V 0 y Ax )l x2Ax 3 

_ = l L my = £ (gy j*)Jy _ ? £ y 2 
y £« £(gy<5*) £y<5* 

- _ Ho y 2 dx _ iJo dx _ _ 3 

fi»d* f;- 2 d« i 

So the centre of mass is at (f, -^j). 

(ii) Uniform solid 

Suppose that the solid has volume density p and that it is sliced into thin 
discs of thickness dx and radius y. 



Then the mass of the disc shown = m = piny 1 dx) = pny 2 dx. 
Now if the centre of mass is at (x, y) 9 we see by symmetry that # = 0. 

_ £ mx Z (pny 2 dx)x x x y 2 & x 

Also X = —- = -=-21 = ^ ~ 2~c~ 

2 ^m 2- pny ° x Z V Px 

lo xy 2 d-y fo x 5 dx i 5 

Jo y 2 dx i 6 ’ 

So the centre of mass is at (f, 0). 
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Example 4 

Find the centre of mass of: 

(i) a uniform semi-circular wire of radius a ; 

(ii) a uniform semi-circular plate of radius a. 

(i) Semi-circular wire 



Take an axis O* along the line of symmetry and consider a small section 
of wire which subtends angle 39 at the centre. The length of this section is 
a 39 , so if its line density is p it will have mass m = pa 39. 

Also the length of the wire is na, so its total mass will be pna. 

By symmetry, y — 0 

, _ Z {p a 59) a cos 9 a ^ 

and x = —— — - = — > cos o du 

Z m pna n 

=> x = — cos 9 d9 = - [sin 0] = ~ ’ 

TtJ^/2 7t L -^ 2 71 

So the centre of mass of the wire is at (2 <z/7 T, 0), i.e., approximately at 
(0.64a, 0). 

(ii) Semi-circular plate 
Method 1 

Suppose that the plate is divided into strips of length 2 y and width 3x 
and that its surface density is o. 

Then the mass of strip = o(2y 3x) and the total mass of the plate 
1 2 

= jtna a. 
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*77 



Also, by symmetry, y = 0. 


So the centre of mass is at (4 a/Zn , 0); or, approximately, (0.42a, 0). 


Method 2 

This last result can be obtained more easily if we make use of the previous 
result for a semi-circular wire. For the semi-circular plate can be dissected 
into a large number of sectors and each of these sectors is approximately 
a triangle with its centre of mass at a distance 2a/3 from O. 
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In this way we see that the uniform plate of radius a has the same centre 
of mass as a uniform wire of radius 2<z/3. 


Hence 


_ 2(2fl/3) 

x =- 

71 


4 a 
3 n 


Exercise 11.2b 

1 Find the coordinates of the centres of mass of uniform laminae bounded 


by: 

(0 

y = 

* 3 , 

y = 

0 , 

x = 2; 

(ii) 

y = 

x 2 , 

y = 

0 , 

x — 1, 

(Hi ) 

y 2 ~- 

= 4x, 

* = 

= 9; 


(iv) 

y = 

ax 

7’ 

y = 

0 , 

x = h. 


2 Find the coordinates of the centres of mass of the uniform solids formed 
by rotating the areas of no. 1 through 360° about the *-axis. 

3 By rotating a quadrant of the curve y = >/(« 2 — x 2 ) about the x-axis, 
find the position of the centre of mass of a uniform solid hemisphere. 

4 By slicing a uniform tetrahedron parallel to one of its faces, show that 
its centre of mass lies on the line joining the centroid of this face to the 
opposite vertex. Hence show that the centre of mass 

(i) of a uniform tetrahedron lies at the centroid of its four vertices; 

(ii) of a uniform cone (or pyramid) is at one-quarter of its height above 
the base (cf. 2 (iv )) ; 

(Hi ) of a uniform hemisphere shell is half-way from its centre to the middle- 
point of the shell (use the result of no. 3 and part (ii)). 

5 A sphere of radius a is cut into two portions by a plane which is distant 
\a from the centre of the sphere. Show by integration that the volume of 
the smaller of the two portions is 57ia 3 /24. 

Calculate the distance of the centre of gravity of this smaller portion 
from the centre of the sphere. (o.c.) 

6 Use Simpson’s rule to find the approximate value of the ^-coordinate 
of the centroid of the area bounded by the x-axis, the line x = 4, and a 
curve joining the points 

*01 2 3 4 

y 0 1.05 2.21 3.50 4.92. (o.c.) 

11.3 Scalar products 

So far we have confined ourselves to the addition and subtraction of vectors 
and to their multiplication by a scalar. We now move to two other opera¬ 
tions which can be performed on a pair of vectors, and obtain their scalar 
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product and vector product. These are both exceedingly important throughout 
applied mathematics and particularly for describing (respectively) the 
work done by a force and the moment of a force. In this section we shall 
concentrate upon the scalar product. 

If a and b are two vectors with magnitudes a and b whose directions are 
separated by an angle 9 , we define their scalar product as 

a. b = ab cos 9. 



As cos (— 9) = cos 9 we need not be concerned about the direction in 
which to measure 9 , and it is also apparent that 

a. b = a(b cos 9) = a x projection of b on a 
= b(a cos 9 ) = b x projection of a on b. 



As an illustration, if F is a force and c is a displacement of its point of 
application, the scalar product F.c is called the work done by the force in 
this displacement. 
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So the work can be regarded as 

(projection of force F in direction of displacement c) 

x (magnitude of displacement), 

or (projection of displacement c in direction of force F) 

x (magnitude of force); 

and if F and c are perpendicular, then F.c = 0, so that no work is done 
when a force is displaced perpendicular to its line of action. 

A number of results follow immediately from the definition of the scalar 
product: 

(i) a. a = aa cos 0 = a 2 

a. a is usually abbreviated as a 2 . So a 2 = a 2 . 

( ii ) a.b = b .a 

(i.e., scalar product is commutative). 

(■ in ) If a and b are non-zero vectors, 

71 

a.b = 0 <=> ab cos 6 = 0 o 9 = — • 



So a.b = 0 <=> a and b are perpendicular. 

(iv) a .{kb) = a {kb) cos 6 = kab cos 6 
k{a. b) = k{ab cos 6) = kab cos 9. 



b X b 

So a. {kb) = k{a.b) 

i.e., the scalar product is distributive with respect to multiplication by a scalar. 

(This last result, like some of its predecessors and also the next, at first 
appears trivial. But it is worthwhile pausing to appreciate its importance, 
and to be grateful that when we are confronted with a. (26), (2 a) .b and 
2 {a.b), we know without hesitation that they are all equal.) 
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(v) (a + b).c = a.c + b.c , 

i.e., scalar product is distributive with respect to addition. 



This result, too, is clear from a figure. 

For (a b).c = (projection of a + b on c) x c 

= (projection of a on c + projection of 6 on c) x c 
= (projection ofaonc) x f 
+ (projection of b on c) x c 

= a.c T b.c. 

So (a + b).c = a.c + b.c 

and, similarly, a.(b + c) = a.b + a.c. 

This result is particularly important in mechanics. For if P and Q are 
two forces and c is a certain displacement, then 

(P + 2) ,c = T Q.c. 

So, for any displacement, the work done by the resultant of two forces is 
the sum of the amounts of work done by the forces separately. 

Example i 

If two pairs of opposite edges of a tetrahedron are perpendicular, so is the 
third. 


c 
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Take one of the vertices as the origin O, and let the position vectors of 
the others be a, b , c. 


Suppose that 

OB 1 CA 

and 

OC 1 AB. 

Then b . (a 

- c) = 0 

and 

O 

II 

s' 

1 

d 

=> b.a — 

O 

II 

"O 

and 

c.b — c.a = 0 


a.b = b.c 

and 

b * c — ci * c 

=> 

0 

II 




a.(b - c) = 0 

OA 1 BC. 


Finally, we see that in the particular case when the four points are 
coplanar, each is the orthocentre of the triangle formed by the other four. 


Exercise 11.3a 

1 Letting a and b be the position vectors of two points A and B and 
expanding the scalar product (a — b ) 2 , prove the cosine rule for AOAB. 

2 Taking O as the centre of a circle, points with position vectors a, —a at 
opposite ends of a diameter, and r anywhere else on the circle, prove that 
the angle in a semi-circle must be a right angle. 

3 Let O be the middle point of the base BC of AABC and let A, B, C have 
position vectors a, b , —b. By expanding (a — b) 2 + (a + b) 2 , prove 
Apollonius’ theorem, that 

AB 2 + AC 2 = 2 (AO 2 + BO 2 ). 

4 AB, CD are two skew (non-coplanar) lines which have their mid-points 
at P, Q respectively. Prove that 

AC 2 + AD 2 + BC 2 + BD 2 = AB 2 + CD 2 + 4 PQ 2 . 

5 A point P lies in the plane of a triangle ABC, and G is the centroid 
this triangle. Prove that 

PA 2 + PB 2 + PC 2 = 3 PG 2 + i(BC 2 + CA 2 + AB 2 ). 

What is the least value of PA 2 + PB 2 -f PC 2 as P varies in the plane? 
What is the locus of P when P varies so that 

PA 2 + PB 2 + PC 2 is constant? (o.c.) 

Scalar product in component form 

Suppose that a and b are expressed in terms of unit vectors i,j and k : 

a = af T a 2 j -f a 3 k 
and b = bf + b 2 j + b 3 k. 
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We can readily find a. b in terms of these components. 

Firstly we note that i,j, k are unit vectors, so i.i = j.j — k.k = 1; and 
f,j, k are perpendicular, so i.j = i.A; = 0 etc. 

Now a.b = (flji + a^j + a 3 A:).(ii* + b + 6 3 A:) 

—»* a.b — ci^b^ “l - dr^b 2 — f - d^b 

In particular, a 2 = a.a = + 0% + a\ 

and b 2 = b.b = b\ + b\ + b 2 . 

Also, if a = li + mj + nk is a unit vector which makes angles 6, (j), \j/ 
with the three axes, then 

cos 6 = a. i = l 
cos (j) = a.j = m 
cos ij/ = a.k = n, 

so that /, m, n are known as the direction cosines of a. 

These results are frequently of great use in finding the angle between two 
directions in space. 


Example 2 

If three corners of A, B, C of a crystal are known to be at points (3, 2, 1), 
(4, 0, 2), (5, 2, 4) respectively, we may wish to find the angle 6 between 
AB and AC. 

B 



Taking unit vectors t, j y k in the direction of the three axes, 
a — AB = i — 2j + k 
and b = AC = 2 i 4- 3 k. 

Then a 2 = l 2 + 2 2 + l 2 = 6 => a = ^6 
and b 2 = 2 2 + 0 2 + 3 2 - 13 => b = ^13. 

Also a.b = 1x2 + — 2x0+ 1x3 = 5. 

Now a.b = ab cos 6 . 
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So 


FURTHER VECTORS AND MECHANICS 
5 = ^/6 X ^13 COS 6 


COS6 VV13 V 78 8.832 

6 = 55° 31'. 


0.5661 


So AB and AC are inclined at 55° 31'. 


Exercise 11.3b 

1 Find the scalar products of the following pairs of vectors and hence the 
angle between them: 

(0 1 + j and i + 2/; 

(ii) i +j + k and i +j — k; 

(Hi) —i + 2 \j — k and i — j + k. 

2 



In the above rectangular box, OA — 4, OB = 3, OC — 2. Furthermore, 
P is the mid-point of GE; Q divides EG in the ratio 2:1; and R is the centre 
of rectangle BFGD. 

For each of the following pairs of lines, find vectors which represent the 
two lines (in terms oft, j, k) and hence the angle between the lines 
(; i) OP and OD ; 

(ii) AQ and AB ; 

(Hi) ER and RC ; 

(iv) PR and QF. 

3 Find whether the line through (1,0,2) and (4,3, —1) meets the 

line through (0,3, —1) and (5, —2,4). Calculate the (acute) angle 
between their directions. (s.m.p.) 

4 The rectangular cartesian coordinates of two points A and B are (1,2) 

and (—1,4) respectively. Show that the column vector is always normal 

to AB and hence, or otherwise, find the coordinates of the foot of the per¬ 
pendicular from the origin to AB. ( G 0 






II. 4 COORDINATE GEOMETRY OF THREE DIMENSIONS 185 

5 Lines /, m have respectively the parametric equations 



A is the point on l with parameter t x , B the point on m with parameter u x . 
Write down an expression for the vector AB. 

Given that the line AB is perpendicular to both l and ra, find the values of 
t l and u x , and show that the length of AB is 7/^/5 units. (s.m.p.) 


* i i .4 Coordinate geometry of three dimensions 

The equations of a line 



Suppose that P 0 is a point whose position vector is r 0 and that a straight 
line passes through P 0 in the direction of the vector u . Then if P, with 
position vector r, lies on this line, it follows that 

r = r 0 + Au (where A is a scalar). 

Suppose now that the components of 

r are (x,y,z), r 0 are (, x 0 ,y 0 ,z 0 ), and u are (/, m, n). 

Then x = x 0 + XI, y = y 0 + fon, z — z 0 + hn , 

^ x Xq _ y yp _ z Zq 
l m n 

which are called the equations of the line. 

Example 1 

Find the vector and Cartesian equations of the line joining the points 
(1, 2, 3) and (2, 3, 5). 

In this case, r 0 =i+ 2 j + 3A: 

u = i + j + 2k. 

So r = ( i + 2 j + 3 k) + X(i + j + 2k) 

=> r = (1 + A )i + (2 + A )j + (3 + 2A)k, 
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and in Cartesian form: 
x = 1 + A, y = 2 + A, z = 3 + 2A 
x-l_y-2_z-3 
^ 1 “ 1 “ 2 


Example 2 

Find the angle between the lines 



y + 1 
3 


z + 2 
4 


These are in the directions of the vectors 
t + 2/ + 3k and 2 i + 3 j -f 4 k. 

Now, if the angle between these vectors is 0, then 

(t + 2 j + 3k). (2* + 3j + 4*) = Vl4 x ^29 cos 6 
=> 20 = -y/14 x ^/29 cos 6 

'» e = 7i5irv29- 0%99 
=> e = i4° 6'. 


The equation of a plane 



Suppose that P (with position vector r) lies in a plane which passes 
through a fixed point P 0 (whose position vector is'r 0 ), and that m, v are 
two non-parallel vectors in the plane. 

Then r — r 0 + hi + fiv (where A, ju are scalars), which is therefore 
the vector equation of the plane. 

Suppose further that the normal to the plane is in the direction n. 

Then n. (r — r 0 ) = 0 

o • 


n.r = n.r ( 
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If we now suppose that 
n = ai + bj 4 - ck, it follows that 
(at + bj + ck) . (xi + yj + zk) — n.r 0 ; 
and if we let n.r 0 ~ — d y we see that 

ax + by 4- cz + d = 0 , 

which is the Cartesian equation of the plane. 

Example 3 

Find the Cartesian equation of the plane through the point (1, 2, 3) which 
is perpendicular to the line 

x— \_y — 2 _z — 3 
~3~ ~ 2 = ~ 

The direction of this line is given by the vector 3 i T 2 7 + k , and the 
vector from (1, 2, 3) to (x, y , 2 ) is 

(x - l)t + (y - 2 )j + (2 - 3 )k. 

As these must be perpendicular, it follows that 

3(* - 1) + 2{y - 2) + (2 - 3) = 0 
=> 3x + 2y + 2 — 10 = 0. 


Exercise 11.4a 

1 Find, in both vector and Cartesian forms, the equations of: 

(i) the line joining ( 0 , 1 , 2 ) and ( 2 , 1 , 0 ); 

(ii) the line joining ( 0 , 1 , 2 ) and ( — 2 , — 1 , 0 ). 

2 Find the equations of: 

(i) the plane which passes through (1, 2, 3) and is perpendicular to the 
line (x — l )/2 = y + 1 = 2 /3; 

(ii) the plane which passes through ( 0 , 1 , 2 ) and is perpendicular to 
i + 2j + 3fc; 

(Hi) the line through (1, 1,2) which is perpendicular to 2x + y + 2 = 4. 


3 Find the angles between the lines 

(i) r=(l+A)t+(l— h)j 4- (24 - X)k 

and r = (1 — ji)i 4- (1 — 2 fi)j 4- (1 + fi)k; 

x — 1 y — 2 2 y 4* 1 2 4-2 

—2~ = — =3 and - = *-y- =- 


(ii) 


1 


and 


1 
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Angles between planes and lines 

We have already seen how to calculate the angle between two lines in 
space. We now proceed to calculate the angle between two planes; and, 
finally, the angle between a line and a plane. 



The angle between two planes can be defined as the angle between two lines 
in the planes both of which are perpendicular to the line of intersection 
of the planes. Hence it must also be the angle between the normals to the 
two planes. 


Example 4 

Find the angle between the planes 

3x + 2y + 2 = 0 and x + 2y + 4 = 0. 

The directions of the normals to these planes are given by 
3 i + 2 / -f k and i + 2j, 

and the angle between these two vectors is 0 , where 


(3* + 2 7 + k) . {i + 2;) 

COS U — - 7 - 7- - 

Vl4 x V5 

= A— = Jo.7 = 0.836 7 

V?o v 

=> e = 33° 12 '. 


The angle between a line and a plane is defined as the angle between the 
line (/) and its projection (p) on the plane ( 71 ). 
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Hence it is conveniently calculated by finding the angle between the 
line and the normal n to the plane, and then taking its complement. 

Example 4 

Find the angle between the line 

x — \ y 2 _ z — 3 

2 ~~ 3 ~ 4 

and the plane 

4x + 3y + 2z + 1 = 0. 

Now the line is in the direction of the vector 2 i -f 3j + 4 k and the 
normal to the plane is in the direction 4 i + 3 j + 2k. 

So the angle 9 between these directions is given by 

(2t + 3j + 4*).(4t+ 3j+ 2*) 

cos 9 = - 7— - 7— - 

V29 x ^29 
= |f = 0.862 0 
=> 6 = 30° 27'. 

Hence the required angle is 

90° - 30° 27' = 59° 33'. 


The distance from a point to a plane 

Find the perpendicular distance from the point (x l9 y l5 z x ) to the plane 
ax + by + cz + d = 0 . 

Let r x = x x i + yf + z x k 
r = xi + yj + zk 
n = ai + hj + ck. 

Then n.r = ax + by + cz = —d= constant, so that the projection 
of r in the direction of n is a constant. 
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Hence n is normal to the plane. But the unit vector in this direction is 
n/w; so if the required perpendicular distance is p , it follows that the foot 
of the perpendicular from P 2 to the plane is 

n 

r x ± p— (the sign depending on the direction of»). 
n 

But this point lies in the plane. 


So 




+ d — 0 


n.r x + P n + d 


0 


= U = \±Tl ± 4 

n 

Now n.r l = ax x + by x + cz x 
and n = y/ (a 2 + b 2 + c 2 ). 


So 


II \ax 1 + by x + cz x + d\ 

]Pl ~ (a 2 + b 2 + c 2 ) 


In exactly similar fashion in a plane, the perpendicular distance from a 
point P! (*j, y x ) to the line ax + by + c = 0 is given by 


\ax x + by x + c\ 
yj ( a 2 + b 2 ) 


So, for example, the distances of the point (1,2) from the line 
3 # + 4 y + 1 = 0 is 


|3 x 1 + 4 x 2 + 1 
V(3 2 + 4 2 ) 
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Exercise n. 4 b 

i Find pairs of vectors which are normal to the following planes, and 
hence the angles between them: 

(i) x + 2y 4- 3z = 6 and x — y — z = 4; 

(ii) x — y — 3 = 0 and y + z — 1 = 0. 


2 Find the angles between the following planes and lines: 

x — 1 y — 2 z — 3 


(i) 3x + 2y -f z = 1 and 

(ii) x + 2y — z = 3 and 


1 2 3 

x — 2 y — 3 z — 1 
1 = -2 = _ 4 


3 (i) Calculate the perpendicular distances from (2, 1) to the lines (in 

the plane z = 0 ) 

3x + 4# + 5 — 0 and 5x — 12 y + 15 = 0. 

(ii) Find the locus of a point P (*, y) which is equidistant from these two 
lines. Describe this locus. 


4 (i) Calculate the perpendicular distances from (1, 2, 3) to the planes 
x + 2y-\-2z + l = 0 and 2x + y + 2 z — 1 = 0 . 

(ii) Find the locus of a point P (x, y , z) which is equidistant from these two 
planes. Describe this locus. 

5 A cube ABCDA'B'C'D' has base ABCD and vertical sides AA' etc. The 
mid-points of the edges AB, AD, AA' are respectively P, Q, R. The cube 
is cut along the plane PQR. 

(i) Find the angle the edge AA' makes with the plane PQR. 

(ii) Find the angle the plane PQR makes with each face of the cube. 

(Hi) Determine the ratio of the volume of the tetrahedron APQR to that 
of the remainder of the cube. 

If the cube is also cut along the plane AB'C, draw a diagram of the 
triangle PQR and mark in the line of this second cut. (s.m.p.) 

6 Find the perpendicular distance of the point (3, 0, 1) from the line 
whose Cartesian equation is 


x—\_y+2_ z 
3 ” 4 “ 12* 


(m.e.i.) 


(First find the general point P of this line and then the condition for the 
vector from (3, 0, 1) to P to be perpendicular.) 


7 Find the perpendicular (i.e., shortest) distance between the two skew 
lines: 

r = (i - 3 j + 3k) + A(-2• +j - 2k) 
r - (3t - j + 2k) + n(2i - 3 j - 2k). 


8 Find the equation of the plane through A = (2, 2, — 1) perpendicular 
to OA. Which point on this plane is closest to the point (2, —1,2)? 

(s.m.p.) 
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*11.5 Vector products 

If a and b are two vectors, we have seen that we can associate with them a 
scalar quantity, known as their scalar product. We now ask ourselves 
whether there is also a convenient vector product which we could usefully 
consider. 



It is clear that the two vectors a and b fix a particular parallelogram, whose 
area is easily seen to be ab sin 0. Furthermore, the most natural direction 
to associate with this parallelogram would be the one which is perpen¬ 
dicular, or normal, to its plane. As for the particular sense (up or down this 
line), we can easily resolve our dilemma by following the movement of a 
right-handed screw which is turning from a to 6. 



We therefore define the vector product , which we shall writef a X b, as 
the vector 

whose magnitude is ab sin 6 ; 
whose direction is perpendicular to a and b ; 
and whose sense is determined by a right-hand screw turning from a to 6. 

The most common use of vector products occurs in mechanics, when a 
force F acts through a point whose position vector is r and whose moment 
about O is defined as r X F: 

In this figure it is seen that the magnitude of this moment is rF sin 6 = Fp , 
and so is equal to the product of the force and the length of the perpen¬ 
dicular arm from O. The moment itself is a vector of this magnitude which 
acts perpendicular to the plane of r and F. In our example, therefore, by 


f Also denoted by a A b. 
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the right-hand screw rule, this moment acts into the paper and represents 
a turning effect which is in the plane of the paper and clockwise. 

As with the scalar product, the properties of the vector product follow 
from its definition. Some are expected, like 

a X {kb) — {ka) X b = k(a X b) 

and can easily be demonstrated in a figure. Others are more surprising, 
like a X a = 0 (since aa sin 0 = 0) 

and a X b = —b X a 

(because a right-hand screw turning from a to b moves against a right-hand 
screw turning from b to a). So vector products are not commutative. 

Lastly, however, vector products (like scalar products) are distributive 
with respect to addition: 

aX(b + c)=aXb + aXc . 

We shall not establish this until the end of this section, though in passing 
we remark on its importance if P and Q are two forces. 



For the result means that 

rX(P+Q)=rxP+rxQ. 

i.e. that the sum of the moments of two forces is equal to the moment of 
their resultant. 

Vector product in component form 

If a — af + a 2 j + a 3 k 
and b = bf + b 2 j + b 3 k, 

we can easily find the vector product a X b. 
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For, firstly, we notice that 

i X * = j X j — k X k — 0 
and that i X j = k = -j X i 
J X k = i - -k Xj 
k X i = j = ~i X k. 



So a X 6 = (flj* + a 2 j + a 3 A:) X (6i* + b 2 j + ^ 3 A:) 

= (fl 2 ^3 — a 3 ^ 2 )i + ( fl 3 ^i ” a \b 3 )j + (fli ^2 ~ a 2 b x )k. 


Example i 



The position vectors of three points A, B, C are a = i, b = i+j,c=j + k 
Find the angles between: 

(i) the planes OBC and OAC; 

(ii) the line OB and the plane ABC. 

(i) Two vectors in plane OBC are b and c, so the normal to the plane is 
along 


b X c = (i + j) X (j + k) 

=ixj+ixk+jxj+jxk 
= k—j + 0 + i = i—j + k; 


and similarly the normal to OAC is along 

a X c = i X (j + k) = k — j — —j + k. 

Now the angle 6 between two planes is the angle between their normals, so 


cos 6 = 


(* -j + + k ) 


V3 x V 2 


n/6 


= Vf 


e = 35° 16'. 


(ii) Similarly, two vectors in plane ABC 

are AB = b — a = j 

and AC = c — a = —i+j + k. 
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So the normal to the plane is along 
j X (-* + j + k)=i + k. 

Now OB = b = i + j. 

Hence the angle between OB and this normal is </>, where 
(* + j) • (* + k ) i 


cos (p = 


V2 X V2 2 
=> 0 = 60°. 

So the angle between OB and the plane itself is 90° — 60° = 30°. 


Scalar and vector triple products 

We now proceed to ask if there is any way in which we can speak of the 
product of three vectors, a, 6, c. 

Firstly, it is clear that the expression a.(b.c) is meaningless, since b.c 
is not a vector but a scalar and so cannot form a scalar product with a. 
By contrast, however, a. (b X c) does exist and, as it is a scalar quantity, 
is called a scalar triple product. Furthermore, a X {b X c) also exists and 
as it is a vector quantity is called a vector triple product . 

Though these quantities are extremely important in applied mathe¬ 
matics, we cannot investigate them further and simply note the connection 
between scalar triple products and volumes: 



Suppose that a parallelepiped is formed from the three vectors a, b , c, as 
shown in the figure. 

Then b X c is a vector of magnitude be sin 0, which is equal to the 
area A of the base parallelogram, in a direction normal to the base. 

So a, (b X c) is of magnitude (projection of a _L b and c) x A 
= perpendicular height x area of base 
= volume of parallelepiped. 

Hence volume of parallelepiped = a.(b X c) 
and it clearly follows that: 

volume of parallelepiped is zero o a, b, c are coplanar <=> a. (b X c) = 0. 
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Furthermore, in components: 

a = a x i + a^’ + a 3 k 

b X c = (b 2 c 3 — b 3 c 2 )i + (b 3 c x — b x c 3 )j + ( b x c 2 — b 2 c x )k 
= * > u • (b X c) d\ (b 2 c 3 b^c 2 ) "F ®' 2 j\b 3 C\ ^ 1 ^ 3 ) "F ^3 (b x c 2 ^ 2 ^ 1 )* 

So a, b, c are coplanar 

< " > ^1 {b 2 c 3 ^ 3 ^ 2 ) ~F ^ 2 ,(b^o x ^ 1 ^ 3 ) "F # 3 (b x c 2 b 2 c 1 ) ““ $ 

which is a result to which we shall return in chapter 13 . 

Exercise 11.5 

1 If a = i — j\ b = i + j + k, c = k, calculate b X c, c X a, a X b. 

2 Denoting corresponding points in no. 1 by capital letters, find: 

(i) vectors which are normal to the planes OBC, OCA, ABC; 

(ii) the angle between the planes OBC and OCA; 

(tit) the angles between these planes and the plane ABC; 

(iv) the angles between these planes and the line AB. 

3 In no. 1 find whether: 

(0 a. (b X c) — b.(c X a) = c.(a X b); 

(ii) a X (b X c) = (a X b) X c\ 

(Hi) a X (b X c) = (a.c)b — (a.b)c. 

Test these results with other vectors a, 6 , c. 

4 Let the three sides of a triangle ABC have vectors a, 6 , c so that 
a -T b + c — 0. 

Hence prove 

(0 bxc = cxa = axb ; 

(ii) the sine formula for AABC. 

* The distributive law for vector products^ 

aX(b-\-c)=aXb + aXc. 

Firstly, we project b and c on to the plane perpendicular to a, and let 
their projections be b x and c x . 



f Or, more strictly, that the operation of vector multiplication distributes over that of vector 
addition. 
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If the angle between a and b is 9, then a X b has magnitude ab sin 9. 

But a X b i has magnitude ab 1 

— ab cos — 9) 

= ab sin 9. 


Also, as a, b and b l are coplanar, a X b and a X b x both lie along the 
perpendicular to this plane. 


So a X b = a X b x \ 

similarly a X c — a X c l , 

and a X (b + c) = a X (b x + c v ). ^ 


( 1 ) 


We now concentrate on the plane perpendicular to a, of which we can 
take a bird’s eye view: 



Now a X b x has magnitude ab l 
and a X c x has magnitude ac x . 

So the vectors 

a X b i, a X c x , a X {b 1 + cj 

are formed by first multiplying 

&i, c l9 b x + 

by a and then rotating through a right angle. 
But it is clear that 

a X {b 1 + c x ) 

is the diagonal of the parallelogram formed by 
a X b x and a X c 1 . 

So a X (&! + Cj) = a X + a X Cj , 
from which it follows by (1) that 

aX{b + c)=aXb + aXc. 
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*11.6 Plane kinematics 

If a point P is moving in a plane, then we have already seen that (relative 
to a fixed point O) its position vector r, velocity v and acceleration a can 
all be expressed in component form: 

r = xi + yj , 
v = r = xi + yj, 
a = r = xi + ijj. 

These, of course, are based on Cartesian axes Ox, O y, and the question 
now arises whether we can possibly obtain similar results using polar 
coordinates. 

Let us start by supposing that at time t the point P has (relative to a fixed 
point O and a fixed base-line), position vector r and polar coordinates 
(r, 6), where r, 9 are functions of t: 

\ 

I 

// 

/ 

/ 

/ 

/ 

/ 



We now let p and q be unit vectors along and perpendicular to OP; so 
that as P moves,/? and q change direction, rotating with angular velocity 6. 



We immediately see that 
p = ()q and q = — &p. 


But r = rp. 

So r = rp + rp = fp + rOq. 

Furthermore, r = rp + rp + r6q + 

= rp + + r&q + rSq — r6 2 p 

= (r — r0 2 )/? + (r# + 2f0)g. 



Summarising, 


II.6 PLANE KINETICS 


*99 


r = rp 

r = fp + r6q 

r = (r — + (r0 + 2r^)g 


Hence the radial and transverse components of the velocity and acceleration 
of P can be denoted: 




In the particular case when P is moving in a circle of radius a , 
r = a => r = r = 0. 



Hence the components of velocity and acceleration become 
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Furthermore, if P is moving with constant angular velocity 9 it follows 
that 6 = 0 , so that the acceleration of P is again shown to be a& 2 directed 
towards O. 



Example 3 (see also 8 . 1 ) 

In 1611 John Kepler announced that each planet moves so that a line 
from the Sun ‘sweeps out’ area at a constant rate (i.e., so that equal areas 
are swept in equal times). Deduce that the planet’s acceleration must be 
along this line. 


P 



If we suppose that in time St the planet moves from P to P' and the 
additional area swept out is SA, then with the usual notation, 



So if dA/dt is constant, it follows that 

5 ^-" 

=> r 2 d + 2nd = 0 
=> rd + 2 rd = 0 . 
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So the transverse component of the acceleration is zero, and the acceleration 
must be entirely radial. It was this discovery, although not expressed in 
this notation, that halted men’s search for transverse forces propelling the 
planets, and caused Newton to propose forces towards the Sun holding 
them in orbit.) 


Exercise 11.6 

1 The polar coordinates r, 0 of a particle after time t are given by 
r = 0 = 2 1. 

(z) Sketch its path, and state its polar equation. 

(zz) Find the radial and transverse components of velocity and accelera¬ 
tion after time t. 

(Hi) Show that its direction of motion is always inclined at 45° to the 
radius and that its radial acceleration is zero. 


2 



A ship sails away from a lighthouse keeping the light on its port quarter at 
a constant angle a. 

r 6 d 6 tan a 

Show that: (0 — = tan a; (zz) — =-; 

w r dr r 

(zzz) if r = a when 0 = 0, then r = a e ecot * . Sketch this path. 

3 

o 



A bead is placed on a straight smooth horizontal wire which is rotating 
about its end O with constant angular velocity co, so that 0 = cot. As there 
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is no radial force on the bead, its radial acceleration must be zero. Hence 
show that 

( i) f - co 2 r = 0, and r = A e wt + B e'™ 1 ; 

(ii) if the bead is initially at rest at a distance a from O (i.e., t — 0, r = a 
and f — 0), then r = a cosh cot ; 

(Hi) f — O0yJ(r 2 — a 2 ). 

4 Describe the curve given by 

/5 cos (t 2 )\ 

r = I ■ / b where t denotes time. 

\5 sin (r )J 

Find a( = r) and deduce the radial and transverse components of the 
acceleration. 

A boy slides down a banister such that his path is given by 

1 5 cos (tf 2 )\ 
r = 1 5 sin (/ 2 ) J , 

\ kt 2 J 

where the units are metres and k is a constant. Indicate on a diagram the 
shape of the banister and the magnitudes of the boy’s acceleration com¬ 
ponents in the directions (a) towards the axis of the curve, ( b) vertically 
downwards, ( c) in a direction perpendicular to those in (a) and ( b ). 

(Note. The unit vectors for r are horizontal south, horizontal west, and 
vertically downwards.) (s.m.p.) 

5 A bead is threaded on a smooth wire in the shape of a cycloid 
* — a(0 — sin 0 ), y = a( 1 + cos 0 ), which is fixed in a vertical plane with 
the positive y-axis as the upward vertical. The bead is released from the 
position given by 9 = 0. Prove that, in the subsequent motion 

(i) dO/dtis constant; 

(ii) the acceleration vector has constant magnitude. (o.c.) 


11.7 Impulse and momentum 

Suppose that a particle of mass m is pulled in a horizontal line by a constant 
force of magnitude F , and that in an interval of time t its speed changes 
from v 0 to v 1 with an acceleration of magnitude a. 

Since the acceleration is uniform, 


v i — v o = a t> 




velocity 


v 0 
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Also, by Newton’s second law, 


F = ma. 

Hence Ft = mv 1 — mv 0 . 



The expression Ft is called the impulse of F during the interval t\ and if, 
further, the product of mass and velocity is called the momentum , we see 
that mv x — mv 0 is the increase in momentum. 


So 


Impulse of force during interval = Change of momentum 


In SI, the unit of impulse Ft is 1 N s (and, since 1 N = 1 kg m s 2 , is 
the same as 1 kgms" 1 , which is also the unit of momentum, mv). 


Example i 

A stone of mass 2 kg is thrown vertically downwards from a high cliff with 
velocity 4ms" 1 . Find the impulse of its weight in the first two seconds of 
its fall and verify that this is equal to its increase in momentum. 

Taking g = 9.8 m s -2 , the stone’s weight is 19.6 N and the impulse of 
this weight is 19.6 x 2 = 39.2 N s. 

Also, its velocity after 2 s = 4 + 9.8 x 2 = 23.6 ms \ 

So initial momentum = 2 x 4 = 8 N s, 

final momentum = 2 x 23.6 = 47.2 N s, 
and increase of momentum = 39.2 N s. 


Impulse of a variable force 

It frequently happens when a sudden blow is struck — for instance when 
a ball is hit by a cricket bat — that a very large force acts for a very small 
interval of time. In such cases it is often either impossible or unnecessary to 
investigate either the magnitude of the force or the duration of its action, 
and attention is confined to the impulse of the force and the consequent 
change of momentum. Moreover, the force will usually vary with time: the 
thrust of the bat on ball increases very rapidly from zero (when contact is 
first made at time t 0 ) to a maximum value, and then decreases just as 
rapidly to zero (when the ball finally leaves the bat at time t x ). 






In such cases the total impulse during the interval will be the limit of the 
sum of small impulses F <5 1. 


rt i 

So impulse = lim ]T F dt = F dt. 

Jto 


Moreover, at any instant 





dt — \mv~\ 


Vi 
Vo * 


So, again, 

Impulse of force during interval = Change of momentum 


Exercise 11.7a 

1 Calculate the momentum (in N s) of: 

(i) a car of mass 1 tonne travelling at 30 m s“ 1 ; 

(ii) a lorry of mass 10 tonne travelling at 10 m s~ 1 ; 

(Hi) a pellet of mass 5 g moving at 150 m s~ 1 ; 

(iv) an electron of mass 9 x 10“ 31 kg moving at 3 x 10 7 ms -1 . 

2 Calculate (in SI units) the impulse of a force: 

(i) 100 N acting for 3 s; 

{ii) 6 1 N, from t — 0 to t = 2; 

(Hi) 4 1 3 4 N, from t = 1 to t = 3; 

{iv) 2 sin t N, from t = 0 to t = 

3 A car of mass 1 tonne moves from rest under the action of a tractive 
force and a resistance whose difference (the effective force ) has a constant 
value of 400 N. Find: 

(i) the impulse of the effective force in the first 5 s; 

(ii) the resulting momentum; 

(Hi) the resulting velocity. 

4 Repeat no. 3 if the effective force diminishes steadily from 400 N to zero. 
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5 Repeat no. 3 if the effective force after time t is 16(5 — t) 2 N. 

6 A lorry of mass 10 tonne is travelling at 30 m s ” 1 when its brakes are 
applied for a period of 2 s. If the braking force has the constant value 
20 000 N, find: 

(i) its impulse; 

(«) the final momentum; 

(zzz) the final velocity. 

7 Repeat no. 6 if the braking force increases steadily to 20 000 N in the 
2 s interval. 

8 Repeat no. 6 if the braking force increases steadily from zero to 20 000 N 
in one second and then decreases steadily to zero after another second. 

9 A ball of mass 1 kg strikes the cushion of a billiard table perpendicularly 
at a speed of 3 ms " 1 and rebounds at 1 ms -1 . What is its change of 
momentum? If the total time of impact was 0.02 s and the force of the 
cushion is supposed constant, find the value of this force. 

10 Repeat no. 9 on the different supposition that the force of the cushion 
increases steadily from zero and then decreases steadily to zero (all within 
0.02 s). What is the maximum force? 

11 A jet of water of radius 1 cm and speed 4ms -1 plays directly on to a 
steel plate which completely destroys its momentum. Find: 

(z) the loss of momentum of the water which strikes the plate in 1 s; 

(zz) the force of the water on the plate. 

12 A machine gun fires bullets of mass 30 g with muzzle velocity 500 ms -1 
and at a rate of 200 per minute. What average force must the gunner exert 
to prevent recoil? 

13 At a waterfall, a stream (which has no vertical velocity at the top of 

the waterfall) falls 5 m vertically on to a horizontal rock without rebound¬ 
ing. Given that 150 dm 3 of water pass over the waterfall each second, cal¬ 
culate the force exerted on the rock. (s.m.p.) 

14 A horizontal jet of water delivering 5 kg s - 1 at a speed of 10 m s -1 

hits a vertical wall. Assuming that the water does not rebound, calculate 
the force on the wall. If allowance were made for rebound, would this force 
be increased or decreased? Give a reason for your answer. (c.) 

15 A train of mass 1 000 tonne started from rest and the effective hori¬ 
zontal force was recorded at intervals of ten seconds: 


t/s 

0 

10 

20 

30 

40 

50 

60 

F/ 10 5 N 

1.62 

1.45 

1.36 

1.24 

1.07 

0.91 

0.73 


Find the total impulse of this force and hence the final momentum and 
velocity of the train. (The use of Simpson’s rule is recommended.) 
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Impulse and momentum as vectors 

So far we have confined ourselves to the study of a body moving in a straight 
line. We can now consider the more general case of a body of mass m moving 
under the action of a variable force F. 



Suppose that after time t the particle has velocity v and acceleration a ; 
and that when 

t = t 0 , v = Vo ; 
t = t x , V = v l . 

By Newton’s second law, 

F dV 

r = ma = m —; 


so that we naturally write 



This is the first time that we have dared to speak of the integral of a 
vector quantity, but success with the differentiation of vectors is a strong 
incentive for such a step; and in any case, the expression 



can be regarded as shorthand for the vector whose components are 

f'™, p rdi, pzd< 

Jfo J*0 J* 0 

(where X, Y, Z are the components of F). 

We therefore see that the impulse of a force during an interval of time, 
like the momentum mv, is essentially a vector quantity; and, again, that 

Impulse of force during interval = Change of momentum 


The 2 kg mass of Example i is now thrown horizontally with a speed of 
4ms" 1 . Investigate the impulse of its weight and the change of its momen¬ 
tum in the first two seconds of its flight. 
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Taking unit vectors i horizontally and / vertically downwards, we see 
that 

weight = 2 x 9.8j = 19.6/, 

=> impulse = 19.6/ x 2 = 39.2/. 

Also initial velocity = 4* 

and final velocity = 4* + 9.8 x 2/ 

= 4 i + 19.6/. 

So initial momentum = 4t x 2 = 8 i 
and final momentum = (4 i + 19.6/) x 2 
= Si + 39.2/. 

Hence change in momentum = 39.2/ and we confirm that this is equal 
to the impulse of the weight. This can also be summarised diagram- 
matically: 


v 0 




Impulse = / = mv t — mv 0 


Example 3 

Water flowing in a horizontal pipe of radius 2 cm at a speed of 3 m s “ 1 first 
goes round a sharp right-angled bend and then issues as a jet perpendicular 
to a fixed metal plate which destroys its momentum. Calculate: 

(i) its thrust on the bend; 

(ii) its thrust on the plate. 

In 1 s the stretch of water coming out of the pipe has length 3 m and 
volume 

n x (0.02) 2 x 3 = 12 tt x 1(T 4 m 3 . 

Now 1 m 3 of water has mass 10 3 kg. So the mass of this water is 1.271 kg, 
and its momentum is 

1.271 x 3 = 3.671 N s. 

Now after going round the bend its momentum changes direction 
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through a right angle, and such a change in momentum must be caused by 
an impulse / such that 



Hence the impulse / acts into the bend, at 45° to the two sections of pipe, 
and has magnitude 

3.671 x « 16 Ns. 

But this is the impulse, over an interval of 1 s, of a force of 16 N. 

The thrust of the water on the pipe, therefore, is of magnitude 16 N 
outwards from the bend at 135° with both sections. 

When the jet strikes the metal plate, the momentum 3.671 N s is reduced 
to zero and this therefore necessitates an impulse of 3.67c N s which is 
caused by a thrust of 3.67c N acting for 1 second. So the thrust of the plate 
on the jet, and hence of the jet on the plate, has magnitude 3.67c « 11.4 N. 

Exercise 11.7b 

1 Find (in SI) the impulse of: 

(i) a force 6 i + 27 + k, acting for 2 s; 

(ii) a force —i— 3 j + 2k, acting for 3 s; 

(in) a force ti + (1 — t\j + 3 t 2 k, from t — 0 to 2; 

( iv ) a force sin ti + tj , from t = 0 to 

2 Find the final velocity of a mass of 2 kg which has initial velocity 
4 i + 3; + 2k and is acted upon (on separate occasions) by the forces 
described in no. 1. 

3 A body of mass 4 kg has initial velocity 4 i + 2 j — k . What constant 
force must act upon it so as to produce: 

(i) velocity 3 i + 2 j + 4A; after 2 s; 

(ii) velocity i + k after 5 s. 

4 A sudden downpour of r cm of rain, of density p kg m -3 , falls in one 
hour on an area of A km 2 and p per cent of this water runs straight off the 
ground into a river b m wide. What is the likely rise in level of the water 
in the river if this flows at v m s _ 1 . 

A containing bank turns the river-flow through a right angle. What is the 
extra force on this bank due to the downpour? (m.e.i.) 
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Conservation of momentum 

So far we have limited ourselves to the consideration of an impulse upon a 
single body and its consequent change of momentum. But it is when a 
number of bodies are connected or interacting that the concept of momen¬ 
tum becomes most valuable. 

Suppose that A and B are two particles and that the only forces acting 
upon them are their mutual reactions R and — R which can (as when two 
billiard balls clash) be changing both in magnitude and direction. 

R 


- R 



Over any interval of time, 


Jto 


R dt = change in momentum of A, 


m t 1 

— R dt = change in momentum of B. 
Jto 


But 


m ti rti 

R d£ -f — R dt 

Jto Jto 


ft I 

0 d* = 0. 

Jto 


So the total change in momentum is zero, and we have verified the 
principle of conservation of momentum for a system which has no external 
forces acting upon it. 


Example 4 

A mass of 2 kg moving with velocity i — 2 j + 3 k coalesces with a mass of 
8 kg which is moving with velocity — i -f 3j 6k. Find the velocity of 
the combined mass of 10 kg. 

If we let this final velocity be v, we can apply the principle of conser¬ 
vation of energy and obtain: 

10t; = 2 (i — 2 j + 3A;) + 8( — i + 3 j + 6 A;) 

= — 6z* + 20 j + 54 k 
=> v = — 0.6i + 2 j + 5.4A:. 

Example 5 

A shell of mass m is fired with velocity v at an angle a to the horizontal. 
If the gun has mass M and recoils horizontally on smooth rails, find its 
speed of recoil, V. 
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Momenta 



The forces acting for the short interval of the firing are: 

(i) the weights Mg and mg, whose impulses over a very short interval are 
negligible; 

(ii) the very large explosive force between gun and shell which has equal 
and opposite impulses / and — / on the gun and the shell; 

(iii) the normal reaction between the rails and the gun which exerts a 
vertical impulse J on the gun. 

So the only external impulse is J, which has no horizontal component. 
Hence the horizontal component of momentum must be conserved. 

=> mv cos a — MV — 0 

mv cos a 

^ = M 


Exercise 11.7c 


1 Find the velocity of the combined mass formed by the coalescing of: 
(i) 1 kg moving at 4i + 3 j — k, 
and 2 kg moving at i + 37 + 2k ; 


(ii) 3 kg moving at i + j + k, 

2 kg moving at 2 i + 3 j — 3k, 
1 kg moving at 3 i + 2 j — 3k. 


2 A mass of 2 kg moving with velocity ms 1 collides and combines 


with a mass of 3 kg moving with velocity 

Calculate the velocity of the single mass so formed. 

Find also the impulse which each mass has received. (s.m.p.) 


1 

-2 


m s 


3 A rocket of mass 10 kg is moving with a velocity whose three components 
(inms" 1 ) are (200, 160, 4) and then separates into two parts. If the rear 
portion has mass 8 kg and velocity (100, 100, 0), find the new velocity of 
its front portion. 

4 A bullet of mass 0.05 kg moving horizontally with velocity 321 ms -1 
strikes a stationary block of mass 16 kg which is free to slide without rotation 
on a smooth horizontal plane. The bullet becomes embedded in the block 
after 0.01 s. Calculate, in newtons, the resistance, assumed uniform, of 
the block to penetration by the bullet. 
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5 A rocket of mass 40 tonnes (= 4 x 10 4 kg) is mounted rigidly on a 
trolley of mass 10 tonne which is free to move horizontally. The rocket 
ejects mass 1 tonne horizontally in a burst lasting 5 seconds, giving the 
ejected matter a speed which may be taken as 2 km s“ 1 relative to the 
ground. Calculate the velocity of the rocket just after the ejection (i) neg¬ 
lecting resistances, and (ii) assuming resistance of 4 x 10 4 N. 

If the same operation were carried out with the rocket mounted vertically 
and free to lift off its mounting show that, for a lift-off to occur, the duration 
of the burst (assumed uniform) must not exceed a certain critical time, and 
calculate this time. (The question of resistance does not arise in this cal¬ 
culation. Explain why.) (Take g = 10 ms -2 .) (m.e.i.) 

6 Two stones, each of mass 5m, are moving across a sheet of smooth ice at 
equal speeds of 10 i> in opposite directions on parallel paths, so that no 
collision is involved. A frog of mass m, travelling on one of the stones, leaps 
across to the other one, and in so doing deflects the stone he leaves through 
30° and changes its speed to 8 i>. 

Find, by drawing and measurement or by calculation, (i) through what 
angle the other stone is deflected and its subsequent speed; (ii) the (vector) 
impulse the frog exerts on the stone on which he lands. (s.m.p.) 

Newton’s law of impact 

Suppose that two bodies of mass m x , m 2 collide and that the components 
of their velocities along the line of impact (i.e., along the common normal 
at the point of contact) 

before the impact are u x , u 2 ; *-► *-► 

« 2 

and after the impact are v x , v 2 ■ •-— -► ---► 

Then Newton’s experimental law of impact states that 
v x — v 2 = — e(u x — u 2 ), 

where e is a constant for the pair of bodies known as their resilience , or 
coefficient of restitution. 

Alternatively, this can be expressed as 

v 2 ~ v \ — e(u x — u 2 ), 

that their relative velocity of separation is proportional to their relative 
velocity of approach. 

If the second body is at rest, for instance when it is a fixed hard floor on 
to which the first body is dropped, it follows that v x = — eu x \ and it is 
quickly seen that e can range from 0 (in the case of a piece of putty dropped 
on to the floor) to nearly 1 (for a ball with a very good bounce, which is 
said to be almost perfectly elastic ). 


So 0 ^ e ^ 1 . 
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Example 6 

A ball is dropped from a height of 5 m on to a hard horizontal pavement. 
If e — calculate: 

(i ) the height to which it returns after its first bounce; 

(ii) the total distance travelled after repeated bounces; 

(Hi) the total time taken. 

(Take g = 10 m s -2 .) 

(/) After falling from a height of 5 m the speed of the ball is 
yj (2 x 10 x 5 ) = 10 ms" 1 ; 

and since e = it rebounds with a speed of^x 10 = 5ms” 1 . 

Hence it will rebound to a height h , where 

5 2 = 2 x 10 x A => h = 1.25 m; 

(ii ) Just as the height of the first rebound is \ x 5 m, so, similarly, the height 
of its second rebound is 

i x ({ x 5) = (i ) 2 x 5 m 

and the height of its third rebound is 

i x (\) 2 x 5 = (i ) 3 x 5 m, etc. 

Hence its total distance travelled (up and down) in making an infinite 
number of bounces 

= 5 + 2{i x 5 + (i ) 2 x 5 + (i ) 3 x 5 + • • •} 

= 5 + 2 x { x 5{1 + { + (4 ) 2 + • • •} 

= 5 + f —= 5+ fx| = 8 fm. 

^ 4 

(Hi) The time taken to drop from 5 m = Is; 

the time taken between first and second bounces = 2 x To ~ Is; 

5 

the time taken between second and third bounces = 2 x — = ^ s ; etc - 

Hence total time taken = 1 + (1 + ^ + ^+ -**) 

= 1 + 2 = 3 s. 

So the ball comes to rest (after an infinite number of bounces) after 3 s and 
travelling a total distance 8 ^ m. 

Example 7 

A ball of mass 2 kg is moving with velocity 3ms -1 and collides directly 
with another of mass 1 kg which is moving in the opposite direction with 
velocity 5ms" 1 . Iff? = f, find their velocities after impact. 



II.7 IMPULSE AND MOMENTUM 


213 


2 1 

Before impact • ^ ^ ♦ 

3 5 


2 

After impact • . ^ 

v ] 


1 


v 2 


Let the final velocities be v x , v 2 respectively. Then as there is no external 
force acting on the system, momentum is conserved: 

-f 1^2 = 2 x 3 f 1 x —5 
=> 2v x + v 2 = 1 . 

Also, by Newton’s law of impact, 

2 — = | x (5 + 3) 

2 -^= 6 . 

2v 1 + v 2 — 1 

— v { + v 2 = 6 

„ - _ 13 

t/j — 3 , U 2 — 3 . 

Hence the final velocities of the two balls are — if m s - 1 and m s~ L 

Exercise n.7d 

1 A ball of mass 4 kg moving at 3 m s “ 1 catches up with another ball of 
mass 2 kg moving at 1 ms ' 1 in the same straight line. If e = f, find their 
velocities after impact. 

2 A ball of mass 5 kg moving at 4 m s” 1 catches up with another ball of 
mass 2 kg moving at 3ms ' 1 in the same straight line. If the second ball 
has velocity 4ms -1 after impact find the coefficient of restitution and the 
final velocity of the first ball. 

3 A ball of mass 4 kg moving with speed 2ms ' 1 meets a ball of mass 2 kg 
moving in the opposite direction with speed 3ms' 1 . Ife = -^, find the 
final velocities of the two balls. 

4 A ball of mass 3 kg moving at 5 ms -1 meets another ball of mass 6 kg 
moving in the opposite direction at 3 ms -1 . If the impact brings the 
second ball to rest, find the final velocity of the first ball and the coefficient 
of restitution. 

5 A smooth sphere A of mass 1 kg moves with speed 5ms " 1 on a smooth 
horizontal plane directly towards a vertical wall. Before it hits the wall it is 
struck from behind by a smooth sphere B with the same radius as A but of 
mass 2 kg moving towards the wall along the same line as A with speed 
10ms -1 . Given that the coefficient of restitution between the spheres is 
find their new velocities. 


v 

=> V 

So 

and 
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The sphere A later strikes the wall and rebounds, the coefficient of 
restitution being It then collides again with B which has been following 
it towards the wall. Show that, after this collision, both A and B are again 
moving towards the wall, the speed of A being three times that of B. 

(m.e.i.) 

6 Three identical imperfectly elastic smooth spheres A, B, G are at rest 
in a straight line, but not in contact, on a smooth horizontal plane in the 
order named. A is given a velocity towards B. Prove that there will be at 
least three collisions during the subsequent motion. 

If the coefficient of restitution between each pair of spheres is deter¬ 
mine whether there is a fourth collision. (l.) 

7 The centres of three equal spheres A, B, C, at rest on a smooth hori¬ 
zontal table, lie on a straight line. The coefficient of restitution between A 
and B is e and between B and C is e’ . The sphere A is projected to strike B 
directly and B then strikes C. Show that there are no further collisions if 

*' < (3* - 1)/(1 + e). (w.) 

8 A c Supaball ’ when dropped on a hard floor displays a coefficient of 

restitution, e, of very nearly unity, and e may be assumed to be independent 
of the velocity of approach. When the ball is dropped vertically from a 
height of 2 m on to a hard floor it comes finally to rest 26.0 s after being 
released. What is e? What is the total distance that the ball has travelled? 
[g = 9.8 m s' 2 ]. (c.s.) 

11.8 Work, energy, and power 

Movement in a straight line 

Suppose that a mass m is pulled along a smooth horizontal table by a 
constant force of magnitude F and that after moving a distance s its velocity 
has increased from v 0 to v x . 


Vo . 



» 

a 


If the magnitude of the constant acceleration of m is a, 
then v\ = Vq + 2 as. 

But, by Newton’s second law, 

F = ma 

=> Fs = mas = \mv\ — \mv %. 

The expression Fs is called the work done by F in the displacement s ; and if, 
further, the expression \mv 2 is called the kinetic energy of a mass m moving 
with velocity v , we see that \mv\ — \mv\ is its increase in kinetic energy. 
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So 


Work done by the force 

in the displacement = Increase of kinetic energy 


In SI, the unit of work (and therefore of energy) will be 1 N m, which is 
called 1 joule ( 1 J). 


Example 1 

A car of mass 800 kg on a horizontal road accelerates from a speed of 
10 ms 1 under the action of a tractive force of 500 N and against a 
resistance of 100 N. Find its speed when it has travelled a distance of 20 m. 

Work done by tractive force = 500 x 20 = 10 000 J. 

Work done by resistance = — 100 x 20 = — 2 000 J. 

Hence total work = 8 000 J. 

If the final speed = v , the gain in kinetic energy 

= i x 800z; 2 - { x 800 x 10 2 
= 400 (z ; 2 - 100). 

So 8 000 = 400 (v 2 - 100 ) 

=> v 2 — 100 = 20 

=> v 2 = 120 

=> v = 10.95. 

So final speed is 10.95 m s -1 . 


Exercise 11.8a 

1 A man pushes a chest of mass 20 kg through a distance of 6 m across 
a horizontal floor by means of a horizontal force of 50 N. 

Find: 

(i) the work done by this force; 

(ii ) the final velocity of the chest if the floor is smooth; 

(in) the final velocity if the coefficient of friction is 0 . 1 . 
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2 Ignoring all resistances, what mean propulsive force is required to 
accelerate an aircraft of mass 10 tonne from rest to a take-off speed of 
40 ms " 1 on a runway of length 1 km? What force would be required if 
the take-off speed had to be doubled in the same length of runway? 

3 A bullet of mass 20 g is travelling at a speed of 400 ms -1 . If it pene¬ 
trates a block of wood to a depth of 5 cm, find the average resistance. 

4 A stone of mass 2 kg is skidding across a sheet of ice at a speed of 10 m s” 1 . 
If it comes to rest in 80 m, find: 

(i) the work done by friction; 

(ii) the coefficient of friction. 

5 A lorry of mass 8 tonne needs to brake from a speed of 10 m s“ 1 in a 
distance of 80 m. 

Find: 

(i) the work done by the braking force; 

(ii) the average braking force; 

(Hi) the average braking force required if this distance were to be halved; 
(iv) the average braking force required if the initial speed were to be 
doubled. 

6 Express a vehicle’s braking distance d in terms of its speed v and maxi¬ 
mum braking force F. Supposing that F is constant, sketch a graph showing 
d as a function of v. 

7 A mass of 2 kg moving at 3ms " 1 catches up with a mass of 3 kg 
moving at 2 m s -1 in the same direction. Supposing that the bodies are 
perfectly inelastic (e = 0 ), find: 

(i ) their velocities after impact; 

(ii) their total loss of kinetic energy. 

8 Repeat no. 7 if the bodies are elastic, with e — 

9 Repeat no. 7 if the bodies are perfectly elastic, with e = 1. 

10 A nail of mass 10 g is driven 1 cm into a wall by a hammer of mass 
100 g which strikes it with a speed of 2 m s 1 . Calculate the average 
resistance of the wall on the assumption: 

(i ) that there is no resilience between hammer and nail (e = 0 ); 

(ii) that there is perfect resilience (e = 1 ). 


Variable forces 

When F is a variable force, the total work done in a displacement 5 will 
be the limit of the sum of small amounts of work F x 8s and is represented 
by the area under the F, 5 graph: 
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Elastic forces: Hooke's law 

One of the commonest instances of a variable force is provided by a wire 
spring or an elastic string. It was first shown by Robert Hooke (1635-1703) 
that if a spring is extended from its natural length but remains within its 
elastic limits, the tension T is proportional to the extension x : 

T = kx, 

where A; is a constant of the particular spring, known as its stiffness: f 

(If, in the case of a spring, * is negative, then so is T : the spring is under 
compression and is exerting a negative tension, or thrust.) 

When such a spring is extended, we see that T and x are in opposite 
directions, so the work done by the tension T = —kx during this extension is 

— kxdx = —{fkx 2 ^ = — \kx 2 , 

Jo 

whose magnitude can also be found from the force, distance graph: 



t It the natural length of the spring (or string) is /, then k is sometimes written as X/l, where X 
is called its modulus of elasticity. Hence T = Xxjl, and the corresponding elastic energy is 
Xx 2 /2L 
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Potential energy 

Finally, we note that if our loaded elastic spring with extension x were to 
return to its natural length, the tension would clearly do a positive amount 
of work \kx 2 . This, therefore, is called the elastic potential energy of the spring, 
being its stored capacity for doing work. 

Similarly, if a mass m is at a height h above a given base-level, its weight 
mg would do an amount of work mgh if it were to return to its original 
level. This, therefore, is called its gravitational potential energy (referred to the 
given base-level) when at height h. 

We therefore see that, for any displacement, the loss in potential energy 
(p.e.) is equal to the work done by the weight and so is equal to the gain in 
kinetic energy (k.e.): 

loss in p.e. = gain in k.e. 


Example 2 

An elastic spring of unstretched length 1 m hangs from the ceiling of a 
room whose height is 3 m. When a 1 kg mass is attached, the spring 
extends to a length 2 m, and it is then pulled down further, so that the mass 
touches the floor. If it is then released, find its velocity at heights 1 m, 
2 m above the floor, and calculate the kinetic energy, gravitational poten¬ 
tial energy, and elastic potential energy in all three positions. 

Let the stiffness of the spring be k. Then, taking g = 10 m s" 2 , the 
weight of the mass will be 10 N 

and k x 1 = 10 => k = 10. 


If we measure depth x from the unstretched position, we can now 
investigate: 

(i) The displacement from x — 2 to x = 1. 

Work done by weight — — 10 x 1 = — 10 J. 

Also tension in spring = lOx. 


So work done by spring 


ing ■ I 


lOx dx 


= O 2 ] i = 15 J 


A T 


x 



Furthermore, if velocity is v when x = 1, gain in k.e. — | x 1 x v 2 . 

So -10 + 15 = \v 2 
=> v 2 = 10 

=> » = ^10 = 3.2 ms" 1 . 
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(ii) The displacement from x = 2 to x = 0 
Work done by weight = — 10 x 2 = — 20 J. 
f° 

Work done by springs — 10* dx 

= lV]o = 20 J. 

So if velocity is V when x = 1, gain in k.e. = \ x 1 x V 2 = | V 2 . 

So -20 + 20 = ±V 2 
=> V = 0. 

Finally, if we measure gravitational p.e. from floor level and elastic p.e. 
from the unstretched position (so that elastic p.e. = \ x kx 2 = 5x 2 ), we 
can express the three forms of energy in the following table: 

Gravitational Elastic 

Depth from ceiling/ m p.e./ J p.e./ J k.e./ J 

1 (x = 0) 20 0 0 

2 (* = 1) 10 5 5 

3 (* = 2) 0 20 0 

So we note that at each level the sum of the three forms of energy is 20 J. 
Hence, in both our examples, the total energy of each system remains 
constant. More generally, this is true in any system, provided that the work 
done by its forces to achieve any particular position is independent of the route by which 
this position is achieved. 

The general result is then known as the principle of conservation of energy. 
It is, however, necessary to issue a warning that not all systems are as 
well-behaved (or ‘ conservative ’) as these. If, for instance, we consider a mass 
being moved across a rough table from a fixed starting point to a particular 
position, it is clear that the work done by friction is entirely dependent on 
the route taken. So friction cannot have a potential and is a non-conservative 
force, so that it is impossible to have a principle of conservation of mechani¬ 
cal energy in this case. We shall meet many similar instances, such as the 
impact of particles which are not perfectly elastic, when the total mechani¬ 
cal energy of the system is seen to diminish, though this is accompanied by 
the production of other forms of energy, such as heat or sound which con¬ 
serve the overall energy of the system. 

Exercise n.8b 

i If a spring has stiffness 200 Nm" 1 , find: 

(i) the tension needed to extend it by 4 cm; 

(ii) the force required to compress it by 5 mm; 

(in) its extension when pulled out by a force of 40 N; 

(iv) its compression when pushed in by a force of 10 N. 
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2 In each case of no. i, find the elastic energy stored in the spring. 

3 An elastic string is extended 2 mm by a tension of 4 N. Find: 

(z) its stiffness; 

(ii) its elastic energy. 

4 A spring is compressed a distance of 1 cm by a force of 500 N. Find: 

(z) its stiffness; 

(ii) its elastic energy. 

5 An elastic string has natural length 2 m and stiffness 50 N m -1 . Find: 
(z) how far it stretches when a mass 20 kg hangs from it; 

(ii) its elastic energy in this position; 

(Hi ) the work done by its weight if the 20 kg was attached to the end of the 
string when unstretched and gently lowered into its equilibrium position. 
(Takeg « 10 m s -2 .) 

6 The particle of mass 20 kg in no. 5 is attached to the string and dropped 
vertically from its point of suspension. When the string has become taut 
and is extended by a further amount x, find: 

(z) the loss in p.e. of the weight; 

(ii) the elastic energy of the string; 

(Hi) the k.e. of the particle. 

Hence find the maximum extension of the string. (Takeg « 10 m s -2 .) 

7 A nylon climbing rope of length 40 m is stretched by a force F. The 
relation between the force F and the corresponding extension x is given in 
the following table: 

x/m 01 2 3456 7 89 10 


F /N 0 1 600 2 400 3 000 3 600 4 100 4 600 5 200 5 700 6 400 7 900 

Use Simpson’s rule to estimate the energy stored in the rope when it is 
extended by 10 m. 

A climber of mass 64 kg is climbing on a vertical rock-face h m above 
the point to which he is securely attached by the rope of length 40 m 
described above. The climber slips and falls freely, being brought to rest 
for the first time by the rope when it has been extended by 10 m. 

Calculate the value of h to the nearest metre. 

Determine also the velocity of the climber at the moment the rope 
becomes taut, and hence, or otherwise, calculate the average deceleration 
of the climber from that moment until he comes to rest. (Takeg = 9.8ms -2 .) 

(J.M.B.) 

8 If the radius of the Earth is R , the gravitational attraction on a body of 
mass 1 kg at a distance x from the Earth’s centre is 

g RZ -f ^ n S X ^ n 

—if x ^ R ; — if x < R. 

x z R 
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Hence find the work done by this force (or weight ): 

(/) if the body came to the Earth’s surface from a very great distance; 

(ii) if the body could move from the Earth’s surface in to its centre. 

9 From no. 8 find: 

(■ i ) the minimum speed with which a body from outer space would 
approach the Earth; 

(ii) the speed with which a body would reach the centre of the Earth if it 
could be dropped from its surface down a vertical shaft. (Take g « 10ms -2 , 
R = 6.4 x 10 6 m.) 


Work as a scalar product 

So far, we have restricted ourselves to the work done by a force when its 
displacement is in its own direction, but we are now able to examine the 
more general case. 

If a force F undergoes a displacement s, the work done is defined as the 
scalar product F.s. We therefore see that work is a scalar of magnitude 
Fs cos 9 (where 9 is the angle between F and s) and so can be regarded as 

F x component of s along F 



or 5 x component of F along s. 



(ii) when 9 — n, work = —Fs; 
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(Hi) when 9 = \n , work — 0 . 

A s 

"- F 


Hence the work done in any displacement perpendicular to the force is 
always zero. In particular, the work done by a normal reaction at a smooth 
surface is always zero. 

Moreover, we have already seen (in 9 . 5 ) that the operation of scalar 
multiplication is distributive with respect to addition. 

Hence F l .s + F 2 .s = (F 1 + F 2 ).s and the sum of the amounts of 
work done by two forces is equal to the work done by their resultant. 

Similarly, F.s x + F.s 2 = F.(s l 4 - s 2 ) and the sum of the amounts 
of work done by a force in two displacements is equal to the work done by 
the force in the resultant displacement. 

Furthermore, if F is a variable force, the work done in a small displace¬ 
ment Ss will be SW , where 


SW = F.Ss. 

Hence the total work can be written as 


W = lim ^ F.Ss = 


F.ds. 


/ 


/ 


F 

I 


Ss 


/ 


Alternatively, if the velocity and acceleration of the particle are v , a 
respectively, we can write 

SW ^ Ss „ 

— = F.— » F.v 
St St 


dW 

~d T 


F.v . 


But, by Newton’s second law, 

dv 
dt 

dv d 


F = ma — m 
dW 




W = [>z; 2 ] 



and again we see that the work done by a force is equal to the increase in 
kinetic energy. 
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Example 3 

A particle of mass 6 kg slides from rest down a rough plane which is inclined 
at 30° to the horizontal. If the resistance to motion is 12 N throughout, find 
the work done by the various forces when the mass has moved a distance 
8 m and hence find its velocity at this point. (Take g ~ 10 ms -2 .) 



The three forces on the particle are: 

(i) its weight 6 x 10 = 60 N; 

( ii ) the normal reaction R ; 

(Hi) the resistance 12 N; 

and the amounts of work done by these in the displacement 8 m are: 

(i) 60 x 8 x cos 60° = 240 J; 

(ii) zero (since force is perpendicular to displacement); 

(Hi) -12 x 8 = -96 J. 

Hence total work = 144 J. 

But final k.e. = \ x 6 x v 2 = 3v 2 . 

So 144 = 3v 2 

=> v 2 — 48 => v — 6.9 ms -1 . 


Exercise 11.8c 

1 Find the work done (in SI) when a force i + 3j + k acts on a particle 
and displaces it by an amount: 

(0 — i + 2 \j + 3k\ (ii) 3i — 2 \j — k\ (iii) i — k. 

2 A car of mass 800 kg is travelling at 30 ms ” 1 with its engine switched 
off when it comes to a small hill. Find: 

(i) its k.e. at the foot of the hill; 

(ii) the maximum height of hill which the car can surmount (ignoring all 
resistances). 

3 A boy slides down a smooth chute at a swimming pool. If its top is 6 m 
above the water and its bottom is 1 m above the water, find his speed: 

(i) on leaving the chute; 

(ii) on entering the water. 
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4 A box of mass 10 kg is attached to the end of a rope of length 5 m 
and hangs freely. If it is then given a horizontal impulse of 100 N s find: 

(i) its initial speed; 

(ii ) its initial kinetic energy; 

(Hi) the height to which it will rise. 

5 A marble of mass 20 g rests at the top of a smooth circular tube which 
has diameter 1 m and is in a vertical plane. If the marble is just displaced 
from rest, find its speed: 

(i) when travelling vertically at the end of the horizontal diameter; 

(ii) when passing through the lowest point of the tube; 

(Hi) when 25 cm above the lowest point. 

Hence find the reactions of the tube on the marble at these three instants. 

6 A boy of mass 40 kg swings on a rope of length 2 m which is just taut 
and attached to a point at the same height. When the rope is vertical, find: 

(i ) the kinetic energy of the boy; 

(ii) the speed of the boy; 

(Hi) the tension in the rope. 

7 A commando swings on a rope fixed at its upper end A. The rope, 
initially inclined at 60° to the downward vertical, remains taut and may 
be considered as inextensible and of negligible mass. The commando can 
be idealised as a particle at the end of the rope. If: 

(i) his mass is 70 kg; 

(ii) the rope is 10 m long; 

(Hi) he lets go when the rope is vertical; 

(iv) A is 12 m above the ground; find his landing speed. 

(Neglect air resistance, and take g = 10ms' 2 .) 

Which of (z), (ii), (Hi), (iv) can be varied, one at a time, without affecting 
this speed? (s.m.p.) 

8 A particle of mass m is slightly displaced from the top of a smooth 
hemispherical dome of radius a. If its speed is v when the radius to the 
particle makes an angle 0 with the vertical, find: 

(i) an equation connecting v with a , g and 0; 

(ii) the normal reaction N in terms of a , g and 0; 

(Hi) the value of 0 for which N = 0 (i.e., at which the particle leaves the 
surface). 


Power 

We know from everyday experience that when a certain task can be 
accomplished by either of two forces, it frequently happens that one of 
them can do the necessary work more quickly. Such a force is then said to 
have 4 greater power 5 . 

More precisely, we define the power P of a force as its rate of doing work : 
P = dW/dt. 
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But we have already seen that when a force F is moving its point of 
application with velocity v , 


dW 

dt 


= F.v. 


Hence 



F.v 


So the unit of power in SI is 1 joule per second and is called 1 watt (W): 


1W = IJs ' 1 


Example 4 

In British units the standard unit of power was 1 horse power (h.p.), being 
the rate of working required to lift a mass of 550 pounds through 1 foot in 
1 second. 

Taking 1 lb = 0.454 kg, 1 ft = 0.305 m, g = 9.81 ms -1 , find 1 h.p. in 
kilowatts. 

In 1 s, a force working at a rate of 1 h.p. will lift 550 lb a height of 1 foot, 
i.e., 550 x 0.454 kg a height of 0.305 m. 

So work done in 1 s 
= (550 x 0.454 x 9.81) x 0.305 J 
= 746 J. 

Hence 1 h.p. = 746 J s _1 = 746 W 
=> 1 h.p. = 0.746 kW. 

Example 5 

Find the power of a pump which raises water from a lake at a rate of 
5 m 3 s" 1 and delivers it 6 m higher at a speed of 8 m s _1 . 

In 1 s, the mass of water moved = 5 x 10 3 kg. 

So weight of water moved = 5 x 10 3 x 9.81 N and the work done in raising 
it through 6 m 

= 5 x 10 3 x 9.81 x 6 = 2.94 x 10 5 J. 

But this water is also given an amount of kinetic energy 
= |x (5 x 10 3 ) x 8 2 = 1.6 x 10 5 J. 

So total work done in 1 s = 4.54 x 10 5 J and the power of the pump 
= 4.54 x 10 5 W 
= 454 kW. 
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Exercise n.8d 

1 A pump delivers water at the rate of 900 kg min -1 from a reservoir 

to a nozzle 8 m above the surface of the reservoir. The water emerges from 
the nozzle with a speed of 12 m s -1 . Calculate the power, in kilowatts, of 
the pump. (c.) 

2 A pump raises 1 000 kg of water each minute from a depth of 20 m 

and delivers it at a speed of 8 ms" 1 . Assuming that the efficiency of the 
pump is 40%, calculate its power. (o.c.) 

3 A railway train has mass 1.4 x 10 6 kg and the maximum power of 

the engine is 1.8 x 10 6 W. Calculate the maximum acceleration, in 
ms -2 , when the train is travelling on the level at a speed of 12 ms -1 , the 
total resistance being 9.6 x 10 4 N. (c.) 

4 A car of power 50 kW has mass 800 kg, which is equally distributed 
over the four wheels. The coefficient of friction between the two driving 
wheels and the road is 0.3. If the road is level, and resistances to motion 
can be neglected, find the maximum acceleration of the car (in ms -2 ): 

(i) at 20 km h -1 ; 

( ii) at 10 km h -1 . (o.c.) 

5 An escalator takes passengers from a lower level to a higher level 

through a vertical height of 20 m. On average two passengers get on the 
escalator per second and two leave at the top per second. If the average 
mass of a passenger is taken to be 75 kg and the machinery has 60% effici¬ 
ency, calculate, in watts, the average power output of the motor which 
drives the escalator. (Take g to be 9.8 m s -2 .) (c.) 

6 A car of mass 1 000 kg is moving on a level road at a steady speed of 
100 km h -1 with its engine working at 60 kW. Calculate in newtons the 
resistance to motion, which may be assumed to be constant. 

The engine is now disconnected, the brakes applied, and the car comes 
to rest in 100 m. Find the additional retarding force arising from the 
application of the brakes. 

If the engine is still disconnected, find the distance the car would run up 
a hill of inclination sin -1 before coming to rest, starting at 100 km h 1 , 
when the same resistance and braking force are operating. (c.) 

7 Sand pours over the lip of a chute and falls vertically onto a conveyor 
belt which is moving horizontally at a steady speed of 0.5 ms T If the 
supply of sand is steady at the rate of 50 kg s - what horizontal momen¬ 
tum is given to the sand each second? Find the force that is required to 
give the sand this momentum. 

If 10 W is the power required to operate the conveyor belt without a 
load, find whether a motor of maximum power 25 W can deal with sand 
at the rate of supply of 50 kg s - 1 . 

If the chute were modified so that the sand was delivered with a hori- 
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zontal component of velocity of 0.5 ms -1 in the direction of motion of 
the belt, find the maximum speed at which the belt could move the load, 
assuming that the power required for the belt itself is unaltered and the 
sand is delivered at the same rate as before. (m.e.i.) 

8 A VTOL aircraft hovers by taking in air through a large intake and 
ejecting it at high velocity downwards. If the relative velocity of ejection 
is v m s' 1 and the mass of the aircraft is M kg, find expressions for: 

(i) the mass of air ejected per second for hovering; 

(ii) the power required to hover. 

Later, when the aircraft is flying as a normal aeroplane, the engines work 
at half this power and the air resistance varies as the square of the speed. 
The maximum steady speed attainable in level flights is Fms" 1 . Show that 
at a speed off Fms" 1 the engines would supply enough power for the air¬ 
craft to climb in normal flight at a rate of 37z//256 ms 1 . (m.e.i.) 

9 A car of mass m started along a horizontal road by an engine which 
works at a constant rate P. Ignoring all resistances to motion and letting 
the distance travelled, velocity and acceleration after time t be a, v, a 
respectively, find equations which connect: 

(i) a , v; (ii) v, t\ (Hi) v, x; ( iv ) *, t. 

10 An engine is fitted to a vehicle of mass m kg. It is known from tests 
that it will give constant tractive force F N up to a road-speed of U m s 1 , 
and at higher speeds will give constant power FU watt. Show that at speed 
z/ms -1 (v ^ U) the tractive force is FU/v N. Form the appropriate 
equations of motion for the vehicle if the resistance to motion at all speeds 
is given by kv 2 N, k being constant. 

Show that there is a maximum attainable speed V m s 1 and find an 
expression for V in terms of k, U and F. Given that m = 500, F = 2 500, 
U = 25 and V = 50, determine the value of k . Calculate the acceleration 
at each of the speeds 5, 15, 25, 35 ms" 1 and hence estimate the time taken 
to reach 40 m s' 1 from rest, giving your result to the nearest second. 
Explain your method. (m.e.i.) 


i 1.9 Oscillations 

In 5.10 we spent a little time studying simple oscillations like alternating 
currents, together with their amplitudes, periods and frequencies. We can 
now take this investigation further, and shall begin by considering three 
other ways in which such oscillations can arise. 

Example 1 

If a spring of stiffness k hangs vertically from a fixed point, with its other 
end attached to a mass m , investigate its equation of motion at a point when 
it is pulled down a further distance x. 
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Unloaded 


Loaded and 
in equilibrium 


Loaded and 
in motion 


(extension = a ) 


(extension = a + x) 



By Hooke’s law, when extension = 0 , the tension = ka. So, since the 
mass is then in equilibrium, ka — mg. 

Also, when extension = a + x, tension = k(a + x). 

So the general equation of motion is 


mg - 

k(a + x) = mx. 

But 

ka = mg. 

So 

— kx = mx 

=> 

X: 

II 

1 

| 


m 

Hence the acceleration of the mass is proportional to its extra displace¬ 
ment and, of course, in the opposite direction: the tension in the spring 
acts as a restoring force , attempting to re-establish equilibrium. 


Example 2 

Investigate the equation of motion of a cylindrical canister buoy, bobbing 
up and down at sea. 

We shall suppose that the buoy has constant cross-sectional area A and 
height /?, and that in equilibrium it is submerged to a depth d. If the density 
of the buoy is p and of the sea is < 7 , we can investigate its motion when it is 
submerged by a further amount x. 

In each case, both of equilibrium and of submersion by a further amount x, 
there are just two forces acting: 

(i) the weight of the buoy. 

Now its volume = Ah 
so its mass — Ahp 

and its weight = Ahpg ; 
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(«) the buoyancy , or upthrust of water on the buoy, which Archimedes 
showed to be equal to the weight of the water displaced. 

Now in the equilibrium position: 

volume of water displaced = Ad 
=> mass of water displaced = Ada 
=> weight of water displaced = Adog 
=> buoyancy = Adog; 

and after submersion by a further amount x, the buoyancy is A(d + x)og. 
At the equilibrium position, we see that 

Adog = Ahpg 
=> do = hp; 


and at the more general position, using Newton’s second law, 


Ahpg — A(d + x)og = Ahpx 
=> — Axog — Ahpx 


x — 


ph 


x. 


So, again, we see that the acceleration of the buoy is proportional to its 
displacement from the equilibrium position and in the opposite direction. 

As our final example, we now investigate an oscillation which, unlike 
the previous cases, is not along a straight line. 
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Example 3 (Simple pendulum) 

A mass m is suspended from a fixed point by means of a light inextensible 
string of length / and swings to and fro in a vertical plane. 



When the string makes an angle 9 with the vertical, the acceleration of 
the mass has two components: 

19 perpendicular to the string, 
and l6 2 along the string. 

Furthermore, there are just two forces acting: 

(i) the weight mg , 
and (ii) the tension T in the string. 

Hence, by Newton’s second law and resolving perpendicular to the string: 
— mg sin 9 — mlO 

=> 6 = —y sin 6 . 

In this case, therefore, 9 is proportional not to 9 , but to sin 9. However, 
if the string is swinging through a small angle, we know that sin 9 & 9. 

Hence & x -jd. 

Simple harmonic motion 

In all three examples the final equation of motion was of the form 
x — — kx (k > 0), 

which is evidently of considerable importance. Any motion which can be 
modelled by such an equation is called simple harmonic motion {or s.h.m .). 
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As k > 0, we can let k = co 2 (so that co is another constant) and obtain 
the equation: 

x = —co 2 x 

This proves to be a more convenient form of the equation and is therefore 
adopted as standard. 

We can now proceed to find its solutions by either of two methods: 


Method 1 

x = —<o 2 x 
=> 2xx — —2 co 2 xx. 

If we now integrate with respect to f, it follows that 
x 2 = — co 2 x 2 + A. 


Supposing that when x = 0 the value of x is a 
(i.e., that a is the amplitude of the oscillation), it follows that 
0 2 = —co 2 a 2 + A 
=> A = co 2 a 2 . 


So x 2 = co 2 (a 2 — x 2 ). 
dx 

Hence — = ±C0y/ (a 2 — x 2 ) 


+ 


d^: 


j{a 2 - x 2 ) 


— 00 dt 


± sin 1 - = cot + £ (where £ is a constant angle) 
a 


- = ±sin (cot + s) 
a 

x = +<3 sin (cot + e). 


Now —sin (cot + e) = sin (cot + £ + n), 
so x — a sin (cot + £) or a sin (cot + £ + n). 

But £ is itself an arbitrary constant, so the second of these general solutions 
is the same as the first. 

Hence x = a sin (cot + £) and any simple harmonic motion can be 
described by an ordinary sine oscillation. The constant £ is called the angle 
of phase , or simply the phase. 

Moreover, sin (cot + e) takes the same value when cot is increased by 
27T, i.e., when t is increased by 2n/co. 


So the period of the s.h.m. is 2ix/co (and its frequency is co/2n). 
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Summarising, we see that the s.h.m. has the properties: 


1! 

1 

S 

K) 


x 2 = co 2 (a 2 — x 2 ) 


x = a sin (cot + e) 


amplitude = a\ period = ; 

phase angle = £ 

co 



Method 2 

Alternatively, x = — co 2 x can be written 

d 2 x 2 
— 5 - + co x = 0 . 
dt 2 

Now x = A e mt is a solution, provided that 
m 2 + co 2 = 0 => m = ±ico. 

Hence the complementary function is 
x = A e i(0t + B e~ i<ot 

= A (cos cot + i sin cot) + B (cos cot — i sin cot) 

= C sin cot + D cos cot, 

where C, D (like A, B) are arbitrary constants. 

If we now choose a , s so that 

a cos £ = C 
a sin 8 = D , 

it follows that 

x = a (sin cot cos e + cos co£ sin e) 

=> x = a sin ( cot + e). 

Hence x = aco cos (cot + e) 

and it follows immediately that 

x 2 = co 2 (a 2 — x 2 ). 

In particular, we can now calculate the periods of the oscillations in the 
above examples: 

Spring of stiffness k supporting particle of mass m : 





Buoy of density p in sea of density a : 
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ft) = 



period = 2 n 



Simple pendulum of length /: 

(0 = Ji ^ p eriod = 2 *^' 


Example 4 

A particle is describing simple harmonic motion in a straight line. When 
its distance from O, the centre of its path, is 3 m, its velocity is 16 m s 1 
towards O and its acceleration is 48 ms " 2 towards O. Find: 

(i) the period of the motion; 

(ii ) the amplitude of the motion; 

(Hi) the time taken by the particle to reach O; 

(iv) the velocity of the particle as it passes through O. 

With the usual notation, x = —co 2 x. 

But when x = 3, x = — 48. 

So -48 = —3m 2 => cu = 4. 

2 n 71 

Hence period = — = — « 1.57 s. 

ft) 2 

(ii) Furthermore, x 2 — co 2 (a 2 — x 2 ) — 16 (a 2 — x 2 ). 

But when x = 3, x = — 16. 

So 256 = 16(<2 2 - 9) 

=> a 2 — 9 = 16 => a = 5. 

So amplitude = 5 m. 

(Hi) Moreover, measuring t from the instant when x = 0, we know that 
x = a sin cot = 5 sin 4 1. 

So letting x = 3, we see that 

3 = 5 sin At ^ \t = sin " 1 0.6 = 0.6434 (= 36° 52 r ) 

=> t = 0.16 s. 

So time taken = 0.16 s. 

(iv) Finally, x 2 = 16(25 — x 2 ) 
and putting x = 0 
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we obtain i 2 = 16 x 25 
=> x = 4 x 5 = 20. 

So velocity of particle as it passes through O is 20 m s 


Example 5 

A particle of mass m hangs at rest from the end of a vertical string which is 
attached to a fixed point and stretched by an amount e. If it is then given a 
further extension a and then released, find the period of its subsequent 
oscillations if (i) a < e\ (ii) a > e. 

Natural Equilibrium Starting General 

length position position position 



I 


I 


e + a 





kT = k(e + x) 

*■ 


mg 


Let the stiffness of the string be k. Then, by considering the position of 
equilibrium, 


mg = ke => k = - 

e 

At the general position, applying Newton’s second law: 

mg — k(e + x) = mx 
=> —kx = mx 

k S 

=> x = - X = - X. 

m e 

So the particle executes simple harmonic motion with co 2 = g/e, provided 
only that the string remains taut. It is therefore now necessary to dis¬ 
tinguish two cases: 

(i) a < e. 

In this case, the string remains taut, so the period of the oscillation is 

27T _ 27T je 

Oi yjg/e 71 V g 
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(ii) a > e. 

In this case the string becomes slack as when x = — e the particle is still 
travelling upwards. So the full cycle consists of three parts: 

(a) Particle below equilibrium position (x > 0): 

time taken = — = n — 

(D V g 

(b) Particle above equilibrium position but string still taut ( — e < x < 0): 

as the motion is simple harmonic, and if we measure time from the instant 
of passing through the equilibrium position, 



Putting x = —e, 



So time taken = 2 



(c) String slack: 

The string becomes slack when x = —e. 
But x 2 = co 2 (a 2 — x 2 ). 

So at this point x = 

Hence time taken with string slack 



g V g* 

So total time taken 




Exercise 11.9 

1 A spring has natural length 4 m and stiffness 20 N m 1 . One end is 
attached to a fixed point O of a smooth horizontal plane and the other end 
to a particle of mass 5 kg which is extended by a further distance of 3 m 
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and then released to oscillate in a straight line. After time t the extension of 
the spring is x. Taking g = 10 ms" 2 , find: 

(i) the equation of motion of the particle; 

(ii) the period of its oscillation; 

(Hi) x in terms of t\ 

(iv) the maximum speed of the particle. 

2 A vertical light spring has two equal masses fixed to it at its ends, as 
shown. When the lower mass lies on a table, the spring is compressed a 
distance d by the weight of the upper mass. If the upper mass is now pushed 
down a further distance 2.bd and then released, examine whether the 
lower mass will at some time leave the table. 

(You may assume that if the lower mass were fixed, the upper mass 
would oscillate symmetrically about its equilibrium position.) 



3 A boat is tossing (moving vertically up and down) in s.h.m. so that at 
one extreme position the weight of a passenger appears to be halved. 

(i) What is the ratio of apparent weight to actual weight at the other 
extreme? 

(ii .) If the total range of motion is 10 m, what is the period of a complete 
oscillation ? 

(Hi) What is the ratio of the volumes of water displaced by the boat at the 
two extremes? 

(iv) During what fraction of the time is the apparent weight within 10 per 
cent of the real weight? (m.e.i.) 


4 A particle of mass 2 kg is hung from the end of a vertical string and 
stretches it by 10 cm. Find the period of the oscillation if it is then pulled 
down a further distance of: 

(■ i) 5 cm; (ii) 15 cm; 

and then released. (Take g « 10 ms" 2 .) 

5 One end O of a light elastic string OA, of natural length 4 a and modulus 
3 mg, is attached to a fixed point of a smooth horizontal table. A particle of 
mass m is attached to the other end A. The particle is pulled along the table 
until it is at a distance 5 a from O and is then released with the string straight. 
Show that the particle first reaches O after a time 



(71 + 8 ) 


(o.c.) 
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6 A man stands on a horizontal rough plank which performs a horizontal 
simple harmonic motion of period T and amplitude a. The coefficient of 
friction between the man and the plank is jU. Show that he will not slip on 
the plank if 

ligT 2 ^ 4n 2 a. (o-C.) 

7 Two particles of equal mass on a smooth horizontal table are connected 
by an elastic string of natural length a and modulus of elasticity equal to 
the weight of one particle. If the particles are held at rest at a distance 3 a 
apart and are then released simultaneously, find the time which elapses 
before they collide. 

8 A particle P, of mass m, is fastened to one end of a light string, of natural 
length a and modulus of elasticity A, whose other end is fixed at O. 
A particle Q, also of mass m, is fastened to one end of a light inextensible 
string of length a whose other end is fastened to P. Initially the system is at 
rest with OPQ in a vertical straight line and OP of length a , so that P 
is at the middle point of OQ. The system is now released. Prove, by con¬ 
siderations of energy or otherwise, that the length of OP is never less than a. 

The length of OP at time tis a + x. Prove that, 

9 X . d 2 * 2 

if n 2 = -—» then —j + n x = g. 

2 ma d t 

Show that 

X = -4 (1 - cos nt) and deduce the tension at time t in the string PQ. 

n (C.) 

9 A particle of mass m hangs at rest from a light inextensible string of 

length /. The particle is suddenly struck a small blow of impulse mv, per¬ 
pendicular to the string. Find the amplitude of the subsequent motion and 
the time taken for the particle to rise to a height equal to one-half of its 
greatest height, measured from the initial position. (m.e.i.) 

10 A particle of mass m is attached to the mid-point of a stretched light 
string whose tension is T and whose ends are fixed to two points distance l 
apart on a smooth horizontal plane. Show that if the particle is moved a 
distance x perpendicular to the string, the restoring force is 2 T sin 0, 
where 0 is the angle which the parts of the string make with their original 
line. 

Hence find the equation of motion when 0 is small, and the period of 
oscillation. 

*ii.io General statics: moments, couples, and equilibrium 

So far we have limited our study to that of a single particle. But we are often 
concerned with the behaviour of collections of particles, both at rest and 
in motion. These might range from the particles of a highly compressible 
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gas injected into a turbine to those of water, nearly incompressible, 
in waves at sea. But the simplest such collection, with which we shall be 
principally concerned, is one in which the distances between every pair of 
particles remains constant, and this we know as a rigid body . First, however, 
we must develop the idea of moment of a force. 

Moments of forces 

When we considered a particular force, of known magnitude and direction, 
acting on a single particle, there was never any doubt about its line of action: 
it necessarily had to pass through the particle. But if we now consider a 
collection of particles, say a chair, it is a common experience that a par¬ 
ticular force can have a very different effect if its line of action is shifted: 
the reaction of a man on a chair might in one position produce stability, 
whilst if exactly the same force acts at a different position the chair might 
topple. In the case of a force acting on a single particle it was unnecessary 
to consider the turning effect of the force, because it had none. But when we 
consider a collection of particles, we must clearly study the turning effects 
of the individual forces involved. 

To take a simple example, let us suppose that a beam of length 2 m is 
freely hinged at one of its ends O so that the beam is in a horizontal position, 
and let us further suppose that a force of magnitude 1 000 N acts vertically 
upwards at its other end. 

1 000 N 


Now it is clear that the force of 1 000 N exerts a turning effect about O, 
and we know that this effect can be altered 
{a) by increasing, or decreasing, the magnitude of the force; 

(b) by altering its direction; 

(c) by altering the position of its line of action, perhaps by moving it 
nearer to O. 

The reader will already be familiar with a measure of such a turning 
effect called the moment about 0 of the given force, which is influenced by such 
changes. This is usually defined as the product of the magnitude of the 
force and its perpendicular distance from O: 


moment about O of F — pF. 
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Let us now consider what will happen if we radically alter the direction 
of the force. For instance suppose that instead of acting vertically, the force 
remains perpendicular to the rod but acts in a horizontal direction: 

1 000 N 



It is immediately apparent that such a change alters the turning effect 
from being in a vertical plane to being in a horizontal plane, as shown by 
the arrows in the two figures. 

We therefore see that a turning effect is defined only when we know not 
only its magnitude and point of application but also the plane in which it is 
acting. A bowler, for instance, may give equal turning effects to three 
successive balls, one with top spin, the next as an off-break and the last as a 
leg-break: their distinguishing features are the planes in which they are 
made to spin. Now each of these planes is perfectly defined by the direction 
of its normal so we see that for the turning effect to be fully defined we need 
to know both a magnitude and also a direction , that of the normal to the 
plane in which it acts. It is, therefore, no surprise that the moment of a force 
should emerge not as a scalar quantity, but as a vector. 


Moment as a vector product 

A force and its line of action are completely defined by: 

(i) a vector F representing the force; and 

(ii) the position vector r of any point on its line of action. 
Let us now consider the vector product r X F. 



First, we notice that the vector r X F does not depend on the particular 
point which was chosen on its line of action. For suppose that, instead of the 
point P (with position vector r), we had chosen another point P' (with 
position vector r '): 
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Now PP' is in the same direction as F. 


So PP' = aF, where X is a scalar 

=> r ’ — r — XF 

=> (r' - r) X F = /IF X F = 0 

=> r f x F — r X F. 


Hence the vector product r X F is independent of the particular point 
chosen on F and depends only on F and its line of action. 

Furthermore, if the angle between r and F is 0, we see that the magnitude 
ofr X Fis 

rF sin 0 = (r sin 6)F = pF. 

o 


Lastly, the direction of r X F is perpendicular to r and to F and so is 
perpendicular to the plane containing the line of action and O. 

In summary, it is evident that r X F depends only upon the vector F 
and its line of action, that its magnitude is the product of F and its per¬ 
pendicular distance from O, and that its direction corresponds to a turning 
effect in the plane containing F and O. We therefore now re-define the 
moment about O of the force F as the vector r X F: 


moment about O of the force F = r X F 


In particular, we notice that if F goes through O, its moment about O is 



0 X F = 0. 

Also r x ( — F) = — (r X F), 

so that the moments about O of two equal and opposite forces are them¬ 
selves equal and opposite. 
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More generally, an extremely valuable result follows directly from the 
fact that vector products are distributive with respect to addition. For 
suppose that P, Q are two forces which meet in a point and that their 
resultant is R. Let the position vector of their point of intersection be r. 



r\ 

\ 

\ 

\ 

o 


Then rXP + rXQ,= rX{P+Q,)=rXR. 


So 


the sum of the moments about any point of two concurrent forces 
is equal to the moment of their resultant. 


Equivalent systems of forces 

It sometimes happens that two apparently different systems offerees have: 

(i ) their vector sums equal; 

(«) their moments equal about every chosen point. 

In such a case we say that the two systems are equivalent . In particular, 
the resultant of two concurrent forces has the same moment about any point 
as the sum of the moments of its constituent forces. So the resultant of two 
concurrent forces is equivalent to the forces themselves. 

It can now easily be shown that two sets of forces are equivalent if: 

(z) their vector sums are equal, and 

(ii) their moments are equal about any one particular point. 

For let us take this point as origin and let the two sets be 

F t at r t (i = 1 , 2.») 

and F[ at r[ (i = 1, 2, . . ., n). 

Then Z F i = Z F i and I r, X F t = £ r[ X F-. 







242 FURTHER VECTORS AND MECHANICS 

So the moment of the first set of forces about another point c 
= I {r, ~ e) X F t 
= £ r i x F t - c X Z F i 
= Zr;xF;-cxZFl 
= I (r; - c) X f; 

= moment of the second set of forces about c. 

Parallel forces 

The question now arises whether we can find such an equivalent force, or 
resultant, when the two forces are not concurrent but parallel. 

Let us consider two parallel forces k X F and k 2 F acting through points 
r x and r 2 . 



Now if a single force is equivalent to the original pair, it must be 
k^F + k 2 F = (k 1 + k 2 )F. 



We now take a point r on the line of action of this force. Since it is 
equivalent to the original forces, we know that 

r x (*! + k 2 )F = r i x k x F + r 2 x k 2 F 
o {k 1 + k 2 )r x F = {k l r l + k 2 r 2 ) x F. 

In particular, we see that this equation is true if 
fi + k 2 )r = k x r x + k 2 r 2 

k 1 f 1 + k 2 r 2 
o r = -> 

k\ T k 2 

which is the point dividing the line joining r x and r 2 in the ratio k 2 \k x . 
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As O could have been taken at any point whatsoever, it is clear that the 
original forces k x F and k 2 F are equivalent to a force (k x + k 2 )F acting 
along a line which separates their lines of action in the ratio k 2 : k x . So this 
force is again known as their resultant. 


Centre of gravity 

We now suppose that a system of particles consists of masses m l , m 2 , m 3 ,. . . 
at points r 1 , r 2 , r 3 ,. . . . 

Then these masses have weights m x g, m 2 g, m 3 g. . . 
acting at points r x , r 2 , r 3 . . . . 


.0 


r 2 


r x 


t 


t 


r 3 


V 






Hence, by a simple extension of the above result, we see that they are 
equivalent to a weight of (m 1 + m 2 + m 3 + * * • )g acting at the point 

m l r 1 + m 2 r 2 + m 3 r 3 + • • • 

-, 

mj + m 2 4- m 3 + • • * 

which we have already met as the centre of mass. 

Hence the weights of the separate particles are equivalent to their total 
weight acting through the centre of mass, which is therefore also called the 
centre of gravity of the system. 


Couples 

There is, however, one exceptional case of two parallel forces, when 
k x = —k 2 and the forces are equal and opposite with different lines of 
action. Such a system is called a couple. 

We now consider a couple consisting of two forces, F at r x and — F at r 2 . 
If a point P is taken with position vector/?, the moment of the couple 
about P is 

(n - p) X F + (r 2 - p) X -F 

= r l *F-pxF-r 2 xF+pxF 

= (»*i - r 2 ) X F 
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F 



• p 

As this is independent of the position of P, it is simply called the moment of 
the couple. Note that the moment of a single force about a point on its line 
of action is zero, so a couple cannot possibly be equivalent to a single force 
and is therefore said to be irreducible. 

Example i 

Coplanar forces F x , F 2 , F 3 act through points whose position vectors are 
r i 5 r 2 > r 3 respectively and 

F x = 4 i + 3/, r x = 3 i + 2/, 

F 2 = hi - 8 /, r 2 = -i + j, 

F 3 = -2* + 6 /, r 3 = 2i - j. 

(i) Show that these forces reduce to a single force and give its magnitude 

and direction. 

(ii) Find a vector equation or the Cartesian equation of the line of action 
of the resultant. 

(Hi) Find the single force through the point r = i + j and the couple 
which are together equivalent to this system. (w.) 

(i) If these forces reduce to a single force, it must be 
F x + F 2 + F 3 = li + j\ 

and so have magnitude ^50 and be in the direction making tan - 1 y « 8° 8' 
with Q*:. 

(ii) If the line of action of its resultant passes through r = xi -F yj , 

then r X (F x + F 2 + F 3 ) = r x X F x + r 2 X F 2 + r 3 X F 3 
=> (xi + yj) X (li +/) = (3t + 2j) X (4* + 3» 

+ (-* + j) X (5t - 8 j) 

+ (2i -j) X (-2* + 6 j) 

=> (x - 7y)k = (1 + 3 + 10)A; = 14* 

=> x — ly = 14. 
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( Hi ) Let the required force be F and the required couple be G. 

Again, F = F x + F 2 4- F 3 — li 4- j. 

Furthermore, 

r X (F l + F 2 + F 3 ) + G = r x X F x 4- r 2 X F 2 + r 3 X F 3 
=> (* + j) X (7 i +j) + G= 14 A; 

=> -6k + G = 14k 

=> G = 20 k. 

So required force is li + j and required couple is 20 A;. 


Exercise n.ioa 

i Find the moments: 

About the point , Of the force , Acting at 


(0 fl 

(«) o 

{Hi) i +j 
(iv) i + 2 j — 3A: 

(») * “ J 

{vi) 2i + j — k 


i +j - k 
i +j ~ 2k 
i + 2j - k 
3 i + 2j — 3k 
— 2 i + 3j + k 
-i + 4 j - k 


i — k 
i + 2 j + k 
i +j ~ k 
3j + A: 
j + 2 A; 

— * + j + 2A: 


2 Determine in each of the following cases whether the two sets of forces 
are equivalent: 

(0 i acting at j and j acting at f, i + j acting at 2 ;; 

(H) —i acting at j and j acting at i, — i + j acting at 2 /; 

{Hi) 2 j acting at O and j acting at 3 f, 3 j acting at i; 

(iv) 4* acting at O and — i acting at —3;, 3f acting at —4/; 

(v) the couple formed by — i at j and i at 2 j, and that formed by —j at 2 i 
and j at i; 

{vi) the couple formed by — i at j and i at —j, and that formed by —j at t* 
and j at k. 


3 Show that each of the following systems of coplanar forces is equivalent 
either to a single force or a single couple (which may be zero). In each case 
find the equivalent force (including its line of action) or the equivalent 
couple: 

(i) i + j at i - j, 3 i - j at 2 i, 2 t + 3 / at 2j; 

(H) i + j at * - j , —3* + j at t + j, 2i - 2j at — f + 2/; 

(m) 2 j at i - j, -1 at j, i at 3 ;, - 2 j at O; 

{iv) 2 i 4- 3/ at i, —3 i + 4; at j, 4* + 5j at i + j, —i 4- 3j at — i — 2j. 

4 ABCD is a square of side 0 . The anti-clockwise moment of a set of forces 
in the plane of the square is 10 aP about A, - 20 aP about B and 10 aP about 
C. Taking the x-axis along AB and the y-axis along AD, determine the 
resultant of the set of forces and the equation of its line of action. 
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Determine the couple and the single force at the centre of the square 
which would be equivalent to this resultant. 

5 (z) Show that three forces represented in magnitude and direction by 

kAB, kBC, kCA and acting along the sides AB, BC, CA respectively of a 
triangle are equivalent to a couple of magnitude 2k times the area of the 
triangle. 

(ii) Use the result (z) to show that four forces represented in magnitude 
and direction by kAB, kBC, kCD, kDA and acting along the sides AB, BC, 
CD, DA respectively of a plane quadrilateral are equivalent to a couple of 
magnitude 2k times the area of the quadrilateral. (m.e.i.) 


Statics of a system: equilibrium 

Any system of particles (which may, or may not, comprise a rigid body) is 
said to be in equilibrium if, under the forces acting, it can remain at rest. 

Let us consider a system in equilibrium which consists of just three 
particles of masses m x , m 2 , m 3 whose position vectors are r 1 , r 2 , r 3 . 

The forces acting on these particles can be of two kinds: 

(z) external forces, which we call F t , F 2 , F 3 , acting on the particles 
m 1 , m 2 , m 3 respectively; and 

(zz) internal forces , between the pairs of particles. We let the force exerted 
upon m x by m 2 be F 12 , and the force exerted upon m 2 by m x be F 2l , etc. 
So the complete set of forces can be represented: 



Now, by Newton’s third law, every pair of internal forces must be equal 
and opposite. 


So F 12 4- F 2 ± — 0. 
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Moreover, since these forces have the same line of action, their moments 
about O must be equal and opposite, and 

r i X F 12 + r 2 X F 2l = 0 . 

If we now look at the equilibrium of the particles separately, we see that 

F x + F ls + F 12 = 0 

F 2 + F 23 + F 2l = 0 

F3 + F 32 + ^31 = 0. 

We now add these equations and remember that 
F 2 3 + F 32 = 0, etc., 
so that 


F\ + F 2 + F 3 — 0. 

Hence if a system is in equilibrium the sum of the external forces must 
be zero, just as when they act upon a single particle. 

Moreover, we can also take the vector products of each of the above 
expressions with r l9 r 2 , r 3 respectively, and obtain 

ri X F 1 + r t X F 13 + r l X F 1Z = 0 

r 2 X F 2 + r 2 X F 23 + r 2 X F 2l =0 

r 3 X F 3 + r 3 X F 32 + r 3 X F S1 = 0. 

We now add these three equations and remember that 

r i X F 12 + r 2 X F 2 i=0 
(and r x X F 13 + r 3 x F 3l = 0, 
r 2 X F 23 + r 3 X F 32 = 0). 

Hence r x X F x + r 2 X F 2 + r 3 X ^3 = 0. 

So the sum of the moments about O of the external forces must be zero. 

Now from the start of this section, O has been any fixed point which it is 
wished to use as an origin. It therefore follows that the sum of the moments 
of the external forces about any point must be zero. 
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For simplicity, we limited ourselves to the consideration of just three 
particles. But the whole of the above analysis can immediately be extended 
to cover any number, finite or infinite. Summarising, we can see that: 


For any system of particles to be in equilibrium: 

(i) the sum of the external forces must be zero; and 

(ii) the sum of the moments of the external forces about any fixed 
point must also be zero. 


Example 2 

A uniform beam of length 10 m and weight 2 000 N has a man of weight 
600 N standing on one end. If the beam is supported at two points 2 m 
from each end, find the reactions at these supports. 




D 


Y 

600 


Y 

2 000 


Letting the two reactions (in newtons) be R , S and taking moments 
about C, 

6R = 8 x 600 + 3 x 2 000 = 10 800 
=> R = 1 800. 

Also, taking moments about B, 

2 x 600 + 6S = 3 x 2 000 
=> 6S = 4 800 

=> S = 800. 

So the two reactions are 1 800 N and 800 N (and we note, as we would 
expect, that R + S = 600 + 2 000). 


Example 3 

A lightweight ladder of length 8 m is inclined at 70° to the horizontal, 
resting upon a rough horizontal path and a smooth vertical wall. If a man 
climbs this ladder and it begins to slip when he is three-quarters of the way 
up, find the coefficient of friction at its lower end. 
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If we neglect the weight of the ladder itself, there are just three external 
forces acting on the system of the man and ladder: 

( i ) W, the weight of the man; 

(ii) S , the normal reaction at the smooth wall; 

(Hi) R , the reaction at the ground which (on account of a frictional com¬ 
ponent) can act at an inclination to the normal. 

Now when the system is in equilibrium these three forces must have 
zero movement about every point. But S and W clearly have zero moment 
about their point of intersection, so R must also pass through this point. 

Hence as the man gradually climbs the ladder, the reaction R makes an 
increasing angle with the vertical; until, finally, when the man is three- 
quarters of the way up, this angle becomes A, the angle of friction, and the 
ladder starts to slip. 

If we now look more closely at this limiting position, we see that (in the 
notation of the figure) 


h 

and x 
- x 

=> tan A = - 
h 


8 sin 70° 
6 cos 70° 

6 cos 70° 
8 sin 70° 

J cot 70° 
f x 0.36 
0.27. 



So pi = tan A = 0.27. 
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Alternatively, we can let the components of R be F and N , and then 
proceed as follows: 


5 


Resolving vertically, N = W. 

Taking moments about the point P, 
F x 8 sin 70° = W x 6 cos 70° 

3 w 

=> F = —- cot 70° 

4 

=> ^ = I cot 70° = 0.27. 



Hence, if slipping takes place at this point, fi = 0.27. 


Exercise 11.10b 

1 A beam AB, of length 22 m and mass 150 kg has its centre of gravity 
at G, 11.5 m from A. It is supported in a horizontal position by two trestles 
at D and E, where AD = 7.5 m and AE — 15 m. Find the pressure on each 
trestle. (Takings « 10ms' 2 .) 

Find the mass of the heaviest man who can sit at either end of the beam 
without tilting it. (o.c.) 



A uniform rod of length 1 m and mass 50 g is freely hinged at A and has 
its other end resting on a smooth peg B. The rod in this position makes an 
angle of 30° with the horizontal. 

Calculate: 

(i) the reaction at B; 

(«) the horizontal component of the reaction at A. 

(Take g ss 9.81 m s -2 .) ( c -) 

3 A uniform beam ABCD, where AB = a, BC = 3 a and CD — a , is of 
weight 4 W and rests horizontally in equilibrium on supports at B and C. 
Loads of weight xW,yW are hung from A, D respectively. Calculate, in 
terms of x, y and W, the load borne by each support. If the sum of the loads 
borne by both supports cannot exceed 20 W, indicate by shading on the 
plane of the coordinate axes O xy the region within which the point ( x , y ) 
must lie. Deduce that the greatest value of y is 14. (o.c.) 
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4 A rod AB, hinged at A, is 4 m long and of mass 5 kg, with its centre of 
mass 3 m from A. A rope is attached to B, passes over a smooth pulley 3 m 
above A and supports a mass M hanging freely. A mass m hangs from B 
and keeps AB in a horizontal position. 

(i) When m = 3, find M ; and, 

(ii) when M = 20, find m. 

5 A non-uniform rod AB, of length a and weight W , rests making an 

angle /J with the horizontal with its lower end B on a rough horizontal 
table. The centre of mass of the rod is at G where AG = \a. Equilibrium is 
maintained by a horizontal string attached to A. Show that the coefficient 
of friction between the rod and the table cannot be less than § cot /?. Find 
the tension in the string. (o.c.) 

6 A uniform beam AB rests in limiting equilibrium with A against a 
rough vertical wall and with B on rough horizontal ground. The vertical 
plane through the beam is perpendicular to the wall. The coefficient of 
friction at A is and the coefficient of friction at B is \. Calculate, correct to 
the nearest degree, the inclination of the beam to the horizontal. (o.c.) 

7 A uniform ladder is leaning against a vertical wall on horizontal ground. 
The coefficients of friction at the wall and at the ground are equal, and the 
ladder is on the point of slipping. Show that the total reaction at the wall 
must be at right angles to the total reaction at the ground. 

Draw a figure with the ladder inclined at 60° to the ground; by means of 
a circle with the ladder as diameter, mark on the figure the point where the 
two reactions intersect. (o.c.) 

8 A circular cylinder of weight W is held with its axis horizontal on a 
rough plane inclined at 30° to the horizontal by a cord wrapped round it, 
one end of the cord being fixed to the cylinder, the other leading away 
tangentially, at right angles to the axis of the cylinder. Determine the 
tension of the cord and the least possible coefficient of friction between the 
cylinder and the plane 

(i) if the cord leads away horizontally; 

(ii ) if it leads away vertically upwards; 

(Hi) if it leads away in such a direction as to make the tension a minimum. 

9 A uniform hemispherical shell rests with its curved surface against a 
rough vertical wall and a rough horizontal floor. If the equilibrium is 
limiting at both points of contact when the plane of the base of the shell 
makes an angle 9 with the horizontal, prove that 


sin 8 


w + 

1 + ll 2 


where fi is the coefficient of friction at both points of contact. 

Prove that such a position is impossible if fi > yj 2 — 1. 

(The centre of mass of the shell bisects the radius of symmetry.) (o.c.) 
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10 The mass per unit length of a rod AB of length / varies linearly from 
m at the end A to 2 m at the end B. Prove by means of the integral calculus 
that the total mass of the rod is 3 ml/ 2 and find the position of the centre of 
gravity. 

An inextensible string of length a is attached to a point P of the rod at a 
distance a (< l) from the end A. The other end of the string is fixed to the 
point C on a rough vertical wall and the end A of the rod rests against this 
wall so that A is below C and AC = a. Find the tension in the string and 
the frictional force at the point A, and show that this vanishes if Z = 18a/5. 

(m.e.i.) 

11 A uniform rod AB is suspended from a fixed point O of a rough vertical 
wall by a light inextensible string of length l which is fastened to B. The end 
A of the rod rests against the wall vertically below O with A below the 
level of B. If the rod is in equilibrium (not necessarily limiting) with 
OA = d and angle AOB — 0, find the tangent of the angle of inclination 
to the horizontal of the resultant reaction on the rod at A. 

If the angle of friction at A is A and equilibrium is limiting, with A about 
to slide downwards, show that 

cos (0 — A) = ^ cos A. 

12 O, A, B, C are the four corners of a square lamina OABC of side a . 
Forces act in the plane of the lamina as follows: 6 P along AO, P along OC, 
2 P along BC and IP along BA. 

( i ) Find the magnitude of the resultant of this system of forces, and the 
equation of its line of action referred to OA, OC as x- and y-axes. 

(ii) It is desired to maintain equilibrium by applying a force and a couple 
at one corner of the square. Find which corner should be selected so that 
the moment of the couple is as small as possible, and state the moment of 
this least couple. 

(iii) Find the couple which must be applied to the lamina so that the 

resultant of the four original forces and the couple acts in a line passing 
through the centre of the square. (m.e.i.) 


*11.11 General dynamics and rotation about a fixed axis 

Dynamics of a system of particles 


Suppose that a system of masses is in motion and that a typical mass at 


point r t is under the action of an external force F t and an internal force 
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Adding these equations over all the particles of the system, 

Y J F i + Y J i i = lrn i r i . ( 1 ) 

But, applying Newton’s third law, 

14 = 0 

and if the total mass of the system is M (= £ m t ) and the centre of mass is 
at r, then 

Mr = Yj m i r i 
=> Mr = £ m i r i . 

Hence equation (1) becomes 

Z l F, = Mr, 

so that the centre of mass moves as though all the mass and all the external 
forces were concentrated upon it. 

When, for example, a diving champion makes a spectacular dive from a 
high board, his movement may be a complicated mixture of twists and 
turns and somersaults, and his body is certainly far from rigid. Neverthe¬ 
less, the only external forces are the weights of the various particles that 
make up his body, and we see from the above result that his centre of mass 
moves as though all these weights and all his mass were concentrated there: 
in other words, his centre of mass (which might itself be varying relative to 
his body as he curls up, ‘jack-knifes ’, or stretches out) is bound to move in a 
perfect parabola. 

Exercise ii.ua 

1 A girl standing on perfectly smooth ice loses her balance. What is the 
path of her centre of mass? If a friend helped to save her as she fell, what 
would be the path of their joint centre of mass? 

2 Find the position vector of the centre of mass of particles of masses 
4, 3, 2, 3 units at rest at the points 

i + j, 2* - j, 2* + j\ 2 i + 3 j 

respectively. If each mass is acted upon by a force directed towards the 
origin and proportional to its distance from the origin, find the direction of 
the initial acceleration of the centre of mass. (l.) 


Rotation about a fixed axis: moments of inertia 

It is beyond our present scope to investigate further the general motion of a 
system of particles, and we must restrict ourselves to the special case of 
a rigid body which is rotating about a fixed axis: 
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Suppose that a rigid body is rotating with angular velocity co about a fixed 
axis and that a typical particle of mass m is at a perpendicular distance r 
from this axis. 

Then the speed of m is rco and its kinetic energy is ^mr 2 co 2 . 

So the total kinetic energy of the body is £ ^ mr 2 a> 2 = mr 2 ) co 2 . 

Now the expression £ wr 2 clearly depends only on the masses m and the 
way in which they are distributed in the body relative to the given axis. 
It is usually denoted by /, called the moment of inertia of the body about the 
given axis. 


So. 


k.e. = \lco 2 , where / = £ mr 2 


Comparing the two expressions 
k.e. = \mv 2 and k.e. = \l(a 2 > 

it might be expected that / will, for rotational motion, play a part very 
similar to that taken by mass for linear motion; and just as mass can be 
regarded as a measure of a particle’s inertia, or reluctance to accelerate 
when under the action of a given force, so the moment of inertia of a body 
about a given axis is a measure of its reluctance to accelerate rotationally 
when under the action of a given moment. 

Our first task, however, is to calculate the moments of inertia of a number 
of different rigid bodies. 


Circular hoop rotating about a perpendicular axis through its centre 

Suppose that a circular hoop of mass M and radius a is rotating about a 
perpendicular axis through its centre. 
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If a typical particle has mass m, 

I = Yj 1711/2 • 

But, for every particle, r = a; 

so / — X ma2 = (Z m ) a2 
=> I = Ma 2 . 

Uniform circular disc , about a perpendicular axis through its centre 

Suppose again that the body has mass M and radius a, and also that its 
surface density is o. 



We begin by dividing the disc into a series of concentric hoops, and 
suppose that a typical hoop has radius r and thickness dr. 

Then area of hoop = 27ir 8r 
and its mass = 2nar dr. 

Hence m.i. of hoop = (2nar dr)r 2 
— 2nar 3 dr 

and m.i. of disc = lim Z 27roT 3 dr 


= J 2nor 3 dr 
— h 10 a 4 . 


But 


M — o x na 2 = no a 2 . 


So m.i. — \Ma 2 . 


Uniform rod 

Suppose that a uniform rod has mass M and length 2a. Its moment of 
inertia clearly depends on the chosen axis, and we shall consider two cases, 
both of them perpendicular to the rod and through its end and mid-point 
respectively. 
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[a) About perpendicular axis through an end 



Let linear density of rod be p. 

Then M = 2 pa. 

Taking short elements of length Sx and mass p <5#, we see that 
m.i. = lim £ p Sx x x 2 



But M = 2 pa. 

c . 4 Ma 2 

00 m.i. =- 

3 


(b) About perpendicular axis through its mid-point 


1 

1 



-a | 

l r 

Sx 

+ a 


o 


In a similar way, 

m.i. = lim £ p dx x x 2 



So m.i. = \Ma 2 . 

Radius of gyration 
Since I = £ mr 2 , 

we can also express it as Mk 2 , where K is a length, called the radius of 
gyration of the body about this axis. 

I = Mk 2 


Hence 
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and we see that 


Zmr 2 

Z m 


is precisely the variance of the distances r of masses m from the given axis, 
so that K is their standard deviation (or ‘root mean square’). 

Thus, for a uniform circular disc about a perpendicular axis through its 
centre, 



and for a uniform rod of length 2 a about a perpendicular axis through its 
centre, 

Mk 2 ~ \Ma 2 
a 

=> k = ~r « 0.58«. 

V3 


Two general theorems 

In the calculation of moments of inertia it is sometimes convenient to make 
use of a general theorem, and here we shall establish two: one for finding 
the moment of inertia about an axis in terms of the moment of inertia 
about a parallel axis through the centre of mass, and the other which is 
applicable only to a lamina. 




258 FURTHER VECTORS AND MECHANICS 

Parallel axis theorem 

Suppose that the moment of inertia of a body about a certain axis is 7 0 
and about a parallel axis through the centre of mass is I G . 

We shall suppose that a typical mass m of the body lies at a point P and 
that a section is taken through P perpendicular to the axes (which in our 
figure are supposed to be perpendicular to the page) meets them in points O 
and G. 

Taking an origin at O, we let 
OG = c, GP = r , OP = R . 

Then 7 0 = X mR 2 = mR 2 

= i>( r + <o 2 

= Y mr2 + 2 Y mr.c'+ Y mc2 
= ]T mr 2 + 2 (Y mr ) • c + (Z m ) c2 - 
We now let the total mass of the body, £ m = M. 

Also, since the centre of mass lies on the axis through G, 
mr = 0. 

So 7 0 = Yj mr2 + 2c.0 + (£ m)c 2 



^o — ^3 + Me 2 


So, for example, we could have found the moment of inertia of a uniform 
rod about its end from that about its centre, simply by writing 


Io = 4 + Ma 2 
= \Ma 2 + Ma 2 
= %Ma 2 . 

And the moment of inertia of a uniform hoop of 
mass M and radius a about a perpendicular 
axis through a point of its circumference 

— 1q ~\~ Ma 2 

= Ma 2 + Ma 2 = 2 Ma 2 . 



‘Perpendicular axes' theorem for a lamina 

Suppose that perpendicular axes O#, Oy , Oz are taken with Q#, O y in the 
plane of a lamina and Oz perpendicular to its plane. Denoting moments of 
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inertia about these axes by I x , / y , / z , 



Hence, for example, we can immediately find the moment of inertia / of a 
uniform disc about a diameter simply by observing that, since I x = / y , 


I + / = |Mz 2 

=> / = 



Similarly, we know that a uniform rectangular lamina of mass M and 
sides 2tf, 2b can be dissected into strips parallel to its sides, so that its 
moments of inertia about parallel axes through its centre are 

\Ma 2 and \Mb 2 . 
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Hence, using the perpendicular axes theorem, we see that its moment of 
inertia about a perpendicular axis through its centre is 

\Ma 2 + \Mb 2 
= \M(a 2 + b 2 ). 

2b 


Exercise u.nb 

Calculate the moments of inertia of: 

1 A uniform rectangular lamina of mass M and sides 20, 2b about a per¬ 
pendicular axis through a corner. 

2 A uniform circular hoop of mass M and radius a about: 

(i ) a diameter; 

(ii) a tangent; 

(m) a perpendicular axis through a point on its circumference. 

3 A uniform circular lamina of mass M and radius a about: 

(i) a tangent; 

(ii) a perpendicular axis through a point on its circumference. 

4 A uniform sphere of mass M and radius a about: 

(i) a diameter; 

(ii) a tangent. 

5 A uniform spherical shell of mass M and radius a about : 

(i) a diameter; 

(ii ) a parallel line through a point of the shell. 

(Hint: + z 2 ) = £ m(z 2 + x 2 ) — £ m(x 2 + y 2 ) 

and ^ 2 + y 2 + z 2 = <2 2 .) 

6 A uniform solid cone of mass M, height A and base-radius a , about: 

(i) its axis of symmetry; 

(ii) a perpendicular axis through its vertex; 

(Hi) a diameter of its base. 

Dynamics of rotating bodies 

Suppose that a body is rotating with angular velocity co about a fixed axis, 
that the moment of the external forces about this axis has magnitude L and 
the moment of inertia of the body about this axis is I. 

Further, suppose that pn a mass m at a distance r from the axis there are 
acting transversely: 

(i) an external force P; and 

(ii) an internal force Q . 
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Then P + Q — mnb 

=> r{P + Q) = mr 2 (b 

=> £ rP + Yj r Q = (£ mr2 )ti> = !<*>■ 

But since the forces Q are internal, it follows from Newton’s third law that 
I rQ = 0. 

Hence ^ rP = Id). 

But X rP = L 

So L = I(b 

Now for a small rotation 36 the work done by P is Pr 36. So in a finite 
displacement, the work done by P is J Pr dd ; and the sum of these amounts 
of work for all the external forces is 



and again we see that the total work done by the external forces is equal to 
the gain in kinetic energy. 

Example i 

A uniform flywheel of mass 200 kg and radius 0.5 m is initially at rest and 
is made to rotate about its fixed axis by a rope which is wrapped round the 
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flywheel for three complete turns and then pulled with a steady force of 
200 N. Find: 

(z) the angular acceleration of the flywheel; 

(ii) its final angular velocity. 

(i) As the flywheel is uniform, 

m.i. = £ x 200 x (0.5 ) 2 = 25 kg m 2 . 

But the moment about its axis of the tension in the rope 
= 200 N x 0.5 m = 100 Nm. 

So the angular acceleration ft) is given by 
100 = 25ci) => ft) = 4. 

Hence the angular acceleration is 4 rad s -2 . 

(ii) As this angular acceleration is constant throughout three complete 
revolutions ( = 67 c rad), the final angular velocity ft) is such that 

co 2 = 0 2 + 2 x 4 x 6 tt = 48tt 
=> ft) = 12.3. 

Hence the final angular velocity is 12.3 rad s“ 1 . 

Alternatively, the total work done by the tension 

— 100 Nm x 67 c rad = 60071 J 

and final k.e. = | x 25 x co 2 J. 

So \ x 25ft> 2 = 600 tt 

=> co 2 = 48 te 

=> ct) = 12.3. 


Example 2 

A flywheel whose moment of inertia is 1.01 kg m 2 is rotating at approxi¬ 
mately 1 200 rev min - the value oscillating between 1.0 per cent above 
and 1.0 per cent below this figure. Calculate in joules the difference be¬ 
tween the maximum and minimum values of the kinetic energy. If the 
change from minimum to maximum occurs in one-hundredth of a second 
calculate the average power input in kilowatts during this period. 

Power to the flywheel is cut off at 1 200 rev min and the wheel is brought 
to rest by a constant couple of 80 N m: calculate how many revolutions 
the wheel makes in coming to rest. (m.e.i.) 


Maximum angular velocity =1212 rev min 


1 212 x 2n 
60 


rad s 1 


= 40.471 rad s 1 
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Minimum angular velocity = 1 188 rev min -1 

= 39 .671 rad s ” 1 . 

So difference of kinetic energy 

= J x 1.01 x {(40.47t) 2 - (39.6 ti) 2 } 

= £ X 1.01 X 8071 X 0.87T 
= 0.505 x 647T 2 « 319 J. 

Hence average power input in 0.01 s 
319 

=-= 31 900 W = 31.9 kW. 

0.01 

When power is cut, 

k.e. = | x 1.01 x (40 tt) 2 = 808tt 2 J. 

Now if the angle through which the flywheel turns before coming to rest 
is a, the work done by the braking couple is — 80a N m. 

But work done = gain in k.e. 

So —80a = -808 tt 2 

=> a = 10.1 7 i 2 rad « 15.9 rev. 


Exercise ii.iic 

1 Calculate the angular momentum and the kinetic energy of: 

(i) a uniform hoop of mass 8 kg and radius j m which is spinning in its 
own plane about its centre with angular velocity 6 rad s“ 1 ; 

(ii) a uniform disc of mass 6 kg and radius 20 cm which is spinning in its 
own plane about its centre with angular velocity 4 rad s _ 1 ; 

(iii) the same disc rotating at the same speed, but about a diameter; 

(iv) a uniform bar of mass 3 kg and length 4 m rotating about one of its 
ends with angular velocity 6 rad s“ 1 ; 

(v) a hollow cylinder of mass 100 kg and radius 40 cm, rotating about its 
axis at 100 revolutions/min; 

(vi) the earth, assuming it to be a uniform sphere of mass 6 x 10 24 kg 
and radius 6.4 x 10 6 m. 

2 A uniform flywheel has mass m and radius a, and is rotating freely 
about its axis with angular velocity w. If a constant braking force F is then 
applied tangentially to its surface, find: 

(i ) its angular deceleration; 

(ii ) the time it takes to stop; 

(Hi) the angle through which it turns. 

3 A bicycle is held with one of its wheels rotating freely with angular 

velocity 200 rev/min. The wheel has mass 2 kg and radius 25 cm, and 
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when the brakes are lightly applied it is brought to rest in three revolutions. 
Assuming that the mass of the wheel and the constant braking force are 
both concentrated at its circumference, find the magnitude of this force. 
For how long would it need to act; and if ji = 0.1, what is the total contact 
force between the brakes and the rim? 

4 A gyroscope consists of a uniform disc of mass 2 kg and diameter 
10 cm mounted on a light axle of diameter 4 mm. A string is wrapped 
round the axle 20 times and pulled sharply with a constant force. If this 
pull takes 2 s to unwrap the string, find: 

(i) the angular acceleration of the disc; 

(ii) its final angular velocity; 

(iii) the tension in the string. 

5 A rope 2 m long is wound round the axle of a flywheel and pulled with 
a constant force of 400 N. When the string is unwound the flywheel is 
rotating at 200 rev/min. Find its moment of inertia. 

6 A uniform rod of length 2 a and mass M is freely pivoted at one end and 
is initially in a vertical position above its pivot. If it then topples over, find 
its loss of potential energy by the time it is ( i) horizontal, (ii) vertically 
below the pivot; and so find its angular velocity at these two positions. 
Finally, by considering the moments of its weight, find the corresponding 
angular accelerations. 

7 A uniform cylindrical flywheel of mass 1 000 kg and radius 0.4 m is 
being accelerated by an electric motor which is working at a constant rate 
of 10 kW. Ignoring all resistances, find the angular velocity of the fly¬ 
wheel after one minute. 

8 A heavy uniform rod of mass m and length l is suspended from one end 
and oscillates freely about its position of equilibrium. Find: 

(*) the moment of its weight about the point of suspension when the rod 
makes an angle 6 with the downward vertical; 

(ii) the equation of motion when the rod is in this position; 

(iii) an approximate equation of motion when 6 is small; 

(iv) the period of small oscillations. 

9 A light thread is wrapped round a uniform cylinder of mass m and 
radius a which is free to rotate about a horizontal axis. A mass M is tied to 
the end of this string and allowed to fall from rest in a vertical line. After it 
has fallen a distance x, find: 

(i) its speed zq 

(ii) its acceleration a. 

10 A uniform circular disc, of mass m and radius a, is free to rotate about a 
frictionless horizontal axis through its centre and perpendicular to its plane. 
One end of a light string is attached to a point on the circumference of the 
disc and part of the string is wound on the circumference. The other end of 
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the string carries a particle A of mass 2 m hanging freely. The system is 
released from rest and a restoring couple of moment mgaO acts on the disc 
when 9 is the angular displacement of the disc. Write down the equation 
of motion of the disc about its centre and the equation of motion of A. 
Hence show that the tension in the vertical part of the string is § (1 + 2 0)mg. 
Show also that 

d 2 0 

5a dF = 4g ~ 2gd ' 

Deduce that the motion of A is simple harmonic and find the period of 
this motion. 

(You may assume that part of the string always remains wound on the 
disc during the motion.) (o.c.) 


11.12 Dimensions 


It will have been noticed that this introduction to mechanics has been 
based entirely upon the three fundamental concepts, or dimensions , of mass, 
length and time. If we now use the symbol [ ] to indicate the dimensions 

of a quantity, we can write 

[mass] = M, [length] = L, [time] = T. 


Furthermore, we can proceed to find the dimensions of other quantities in 
terms of M, L, T, such as 

[area] = [length x length] = L 2 , 

[volume] = [area x length] — L 3 , 




[density] 
[velocity] = 


mass 


|_volumej 
[" length”! 


I _ M _ 

'~U ~ 


ML 




[acceleration] = 


time 
f velocity! 


= \ = LT 
T 


LT 


|_ time J T 


= LT 


[force] = [mass x acceleration] = MLT 2 , 

[moment of a force] = MLT -2 x L = ML 2 T -2 , 
[impulse] = [force x time] = MLT -2 x T = MLT -1 , 
[momentum] = [mass x velocity] = MLT -1 , 

[work] = [force x distance] = ML 2 T -2 , 

[kinetic energy] = [mass x (velocity) 2 ] = ML 2 T -2 , 
f work 


[power] 


|_time 


= ML 2 T 
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It will be seen that the dimensions of impulse are identical (or consistent) 
with those of momentum, and that the dimensions of work are consistent 
with those of energy. This notion of consistency can be used to check, and 
sometimes to derive, equations in mechanics. 


Example i 


A piano wire has mass m, length /, and tension F , and a student knows that 
its period of vibration is 


. . .[ml 

either 2n \— 


or 2n 


Which is more likely to be correct? 

Since [ni] — M, [/] — L, [il = MLT “ 2 , it follows that the dimensions 
of these two possibilities are 

l[ ML \ //MLT _2 \ 

vVMLT“ 2 J or Vv ML ) 

= V T2 =T = n /T- 2 =T- 1 . 

Hence it is clear that the second possibility for a period of dimension T 
would not be consistent^ so that the likelier, (and in fact correct) formula is 
2 nJiml/F). 


Example 2 

A simple pendulum consists of a bob of mass m attached to a string of 
length l and the pendulum is swinging at a point where the acceleration 
due to gravity is g. Investigate, by means of dimensions, the way in which 
the period T depends upon m , /, and g. 

Firstly, we suppose that the period T depends jointly upon m, /, and g in 
such a way that: 

T oc m a l^g y , where a, /?, y are constants. 

But [m a /V] = M°T/(LT~ 2 ) y 
= M a L^ +y T“ 2y 
and [71 = T. 

So for dimensional consistency, 

M°l/ +1 T -2y = T 
= 0 

= 1 => y = 

= 0 => ft — 2 
To c m°/V* 


> a 
— 2 ? 
P + y 


Hence 
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It will be recognised that although this does not constitute a proofs it is 
nevertheless a highly convenient approach to such an investigation, which 
is confirmed by the results of p. 233 that T = (l/g). 


Exercise 11.12 

1 State the dimensions, in terms of M, L, T, of the following quantities: 

(z) pressure (force per unit area); 

(zz) line density; 

(in ) surface density; 

(iv) frequency; 

(u) angular velocity; 

(vi ) angular acceleration; 

(vii) angle; 

(viii ) moment of inertia; 

(ix) rate of loss of mass (e.g., of a rocket); 

(#) constant of gravitation (see 8 . 11 ); 

(xi) stiffness of a spring (see 11 . 8 ); 

(xii) surface tension (energy per unit area). 

2 Use the method of dimensions to predict how: 

(z) The tension T in a string depends upon the mass m of a particle which 
is being whirled round on its end in a circle of radius r and with speed v. 

(; ii ) The height h reached by a stone will depend upon its mass m, the 
energy E with which it is projected vertically and the acceleration g due to 
gravity. 

(Hi) The speed v of sound in a gas will depend upon its pressure j p, its 
density p and the acceleration g due to gravity. 

(iv) The frequency / of oscillation of a light spring depends upon its 
stiffness k, the mass m which it is supporting and the acceleration g due to 
gravity. 

(■ v ) The velocity v of waves in a deep liquid depends upon its density p , 
their wavelength X and the acceleration g due to gravity. 

3 Use the method of dimensions to find an expression for the thrust of 
moving air on a fixed obstacle, assuming it to be the form T — kv a p^A y 
where v is the air velocity, p its density, A the frontal area which the obstacle 
presents to it, and k is a non-dimensional constant. Explain briefly the 
nature of a dynamical argument which leads us when using absolute units 
of force to expect the experimental value of k always to be less than unity. 

Under certain conditions the compressibility c of the air can affect the 
thrust appreciably, c being defined as fractional change in volume produced 
per unit change in pressure. Find the dimensions of c and show that we can 
now obtain a dimensionally correct formula for T by replacing k by any 
polynomial in a quantity 2 defined as VyJ(cp). (m.e.i.) 



268 FURTHER VECTORS AND MECHANICS 

Miscellaneous problems 11 

1 Let O be the centre of the circumcircle of AABC whose vertices have 
position vectors a, b and c. Show that the point H, with position vector 
h = a + b + c, is the orthocentre of AABC (i.e. AH _L BC, BH T GA, 
CH JL AB) and hence show that the centroid G divides OH in the ratio 1:2. 

2 The sides BC, CA, AB of AABC are divided in the same ratio by 
P, Q, R respectively. Investigate the centroid of APQR. 

3 O is a point in the plane of a triangle ABC and AO, BO, CO meet 
BC, CA, AB respectively in points P, Q, R. Using bold letters to denote 
position vectors from O, prove 

(i) that it is possible to find scalars a, /?, y such that 
oca + Pb + yc = 0; 

(ii) that p can be expressed as 
oc _ fib + yc 

P + y a ~ P + y ’ 

(Hi) that P divides BC in the ratio y/ji ; 

(iv) Ceva’s theorem, that 

BP CQ AR 
PC X QA X RB~ L 

Finally, verify Ceva’s theorem in the three cases when O is the centroid, 
the orthocentre, the circumcentre, of AABC. 

4 If a line cuts the sides BC, CA, AB (extended if necessary) of a triangle 
ABC prove Menelaus ’ theorem , that 

BP CQ AR 
PC X QA X RB ' 

5 ABC, A'B'C' are two skew lines and AB:BC = A'B' .B'C'. Prove that 
the mid-points of AA', BB', CC' are collinear. 

6 The vertices of a tetrahedron are A, B, C, D and the points P, Q, R, S 
divide the segments AB, BC, CD, DA in the ratios/?: 1,^:1, r: 1, $: 1 where 
/>, q , r, s are all positive. Show that P, Q, R, S are coplanar if pqrs — 1 . 

Show that, if also p = r and q = s, then PQRS is a parallelogram. 

(c.s.) 

7 The base of a solid hemisphere of radius a is firmly attached to one 
plane end, also of radius a , of a solid cylinder of length /, thus forming one 
solid of revolution. The material is the same and uniform throughout the 
composite body. Find the position of its centre of gravity. 

If the body can rest in equilibrium with any point of the surface of the 
hemi-spherical portion resting on a horizontal plane, show that a = lyj 2 . 

(m.e.i.) 
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8 Prove that if the sum of the squares of two opposite edges of a tetrahedron 
is equal to the sum of the squares of another pair of opposite edges, then 
the remaining pair of opposite edges are perpendicular. 

9 OA, OB, OC are three lines through the point O and the angles BOG, 

COA, AOB are a, /?, y respectively. Calculate cos 2 0, where 0 is the angle 
between the line OA and the plane OBC. (c.s.) 

10 A ball of mass m is thrown into the air with initial velocity w, and 
during its flight it experiences a resistance from the air given by the vector 
— mcv, where v is the velocity of the ball at that instant. Write down a 
differential equation for the velocity as a function of the time, and verify 
that this equation and the initial conditions are both satisfied if 

1 - 

V — - g _|_ e ct u 

C 

where g is the vector acceleration due to gravity. 

Find an expression for r, the displacement of the ball from its initial 
position at time t\ and show that, if c is small, then this is approximately 
equal to 

r* - c{& 3 g + \t 2 u), 

where r* is the position that it would occupy in the absence of air resistance. 
« (m.e.i.) 

11 A particle of unit mass is attracted to a point O with a force —9r, 
where r is the position vector of the particle relative to O. Initially the 
particle is at a point with position vector 16t and moving with velocity 2 ^*, 
where i andj are fixed unit vectors in perpendicular directions. Verify 
that the subsequent motion is described by the equation 

r — 16* cos 3t -h 8 j sin 3 1, 

and interpret this equation geometrically. 

If there were also a resistance to motion of magnitude ten times the 
speed, obtain a differential equation for the motion and show that it has a 
solution in the form 

r = A e~ l + Be~ 9t . 

With the same initial conditions as before, find the values of .4 and B , and 
investigate the nature of the motion for t > 0 . (s.m.p.) 

12 A particle of mass m is projected with velocity V at an angle a to the 
horizontal. The air resistance is mk times the speed of the particle, where k is 
a constant, and opposes the direction of motion at all times. Prove that the 
equations of motion of the particle can be put in the form 

x + kx — 0 

y + fy + g = o 
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where x and y are respectively the horizontal and vertical distances from 
the point of projection. 

Show that 

V 

x — — cos a ( 1 — e kt ) 
k 

and find a similar expression for y. 

Sketch the path of the projectile. (m.e.i.) 

13 A steady uniform stream of air of density p and speed u strikes at right 
angles a plane surface of area A and proceeds, after the contact, with its 
effective speed still in the original direction and equal to ku (where k > 0 ). 
Derive an expression for the thrust of the air on the area, stating carefully 
the principles on which your derivation is based. 

A householder has a garage with rectangular doors each 1.20 m wide 
by 2.25 m high, turning about vertical hinges at their edges. On a windy 
day he attempts to keep one door open in such a position that the wind is 
blowing at right angles to the door. To do this he puts a brick on the ground 
at the outer corner of the door, hoping that the friction of the brick on the 
ground will be sufficient for the task. Given the data listed below, calculate 
the greatest wind-speed for which the brick will be effective: state any 
additional assumptions you find it necessary to make in the course of your 
work. 

Mass of brick = 3.0 kg; coefficient of friction = 0.4; density of air = 
1.10 kg m -3 ; k = 0.6; g = 10 ms" 2 . (m.e.i.) 

14 A shell of mass M is at rest in space, when it bursts into two fragments, 
the energy released being E. Show that the relative speed of the fragments 
after separation cannot be less than 2yJ(2E/M). 

Explain how your conclusion is affected if the shell is moving initially 
with speed U. (c.s.) 

15 A space craft of mass 10 4 kg is going round the moon in a circular 
orbit at a height of 10 6 m. Find the time of the orbit, taking g on the moon 
as 1.60 ms -2 and the diameter of the moon as 4.14 x 10 6 m. 

To escape from the moon the kinetic energy of the body has to be 
doubled. What is the magnitude and direction of the least impulse which 
can be applied to the body to enable it to escape? 

If this impulse is spread over a period of one minute, what is the apparent 
weight of a 100 kg man inside the space craft during this time? (m.e.i.) 

16 A particle of mass m is attached to the middle point of a light elastic 
string of natural length a and modulus mg. The ends of the string are 
attached to two fixed points A and B, A being at a distance 2 a vertically 
above B. Prove that the particle can rest in equilibrium at a depth baj\ 
below A and find the period of oscillation if it is displaced slightly in the 
vertical direction. 
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17 A mountaineer falls over a cliff. He is attached to a rope which, 

providentially, stretches so that he just touches the ground at the foot of 
the cliff. Find the height of the cliff and the time taken for the mountaineer 
to reach the ground (in terms of his mass, the length of the unstretched rope 
and its elastic modulus). (c.s.) 

18 Three particles A, B and C each of mass m lie at rest in a straight line 

on a smooth horizontal table, joined by equal taut strings AB, BC. A hori¬ 
zontal impulse P, is given to B in a direction perpendicular to the line of 
the strings. Describe the nature of the motion up to the instant when A 
and C meet, and calculate the loss of energy of the system if these masses 
do not separate after colliding. (m.e.i.) 

19 A pump working effectively at P kW delivers water through a nozzle 
of area A cm 2 at a speed ofms" 1 , raising the water h m in the process. 
Obtain a formula for P in terms of A, v, h and g\ assume that 1 m 3 of water 
has a mass of 10 3 kg. 

The jet is directed horizontally at right angles to a vertical blade of a 
water wheel which is initially at rest. The wheel has a moment of inertia / 
about its axis which is horizontal and the point of contact of the water is c m 
from the axle; the water falls vertically after striking the blade. 

What is the initial angular acceleration of the wheel? At what rate is 
energy being given to the wheel initially? (m.e.i.) 

20 A tank containing a liquid of density p has a hole in the bottom. The 
speed u at which the liquid escapes (averaged over the cross-section of the 
hole) is given by u = Xp Vg, where V is the volume of liquid in the tank and 
X is a constant. Find the dimensions of X in terms of mass, length and time. 

Given that the hole has cross-sectional area A, write down a differential 
equation for V as a function of t, in terms of the various constants, (s.m.p.) 

2 1 The velocity of propagation of waves of wavelength / on the surface 
of a liquid, is proportional to 



where a is the surface tension and p the density. Deduce the physical 
dimensions of <x. 

Lord Rayleigh showed that the period, T, of vibration of a small liquid 
drop, when given a slight distortion, depended only on <7, p and the radius r 
of its spherical equilibrium shape. Assuming that 

T = Ca a p p r d , 

where C , a, /? and y are dimensionless constants, find a, /J and y. (c.s.) 
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Probability distributions and 
statistics 


12. i Probability distributions and generators 

Take two coins, spin them as a pair ten times and record your results: two 
heads (HH), two tails (TT), or one of each (HT). When the author per¬ 
formed this experiment he obtained the sequence 

HT, HT, TT, HT, HH, HH, HT, HT, HH, HT, 


which can conveniently be classified according to the frequency f r with 
which the number of heads obtained is x r : 


No. of heads (x r ) 
x x — 0 
x 2 ~ 1 
x 3 = 2 


Frequency (f.) 

fi = 1 

A = 6 
A = 3 

n = 10 


In chapter 7 we called this a frequency distribution, and saw that its 
mean m is given by: 

m = ^Z/r*r = T7>(1 X 0 + 6 x 1 + 3 x 2) = 1.2 
and its variance s 2 by 
S 2 = ^f r (x r - m ) 2 

= I Z /r*r - m2 
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= -jV(l x 0 2 + 6 x l 2 + 3 x 2 2 ) - 1.2 2 
= 1.8 - 1.44 
= 0.36. 


If instead of recording frequencies, we were to tabulate the relative 
frequencies fjn = g r we obtain: 

No. of heads ( x t ) Relative frequency (g r ) 

*i=° 

*2 = 1 S2 = -fe 

*3 = 2 g 3 = ± 

Furthermore, we can see that 


m = £ Sr X r 

and r 2 = £ £r(*r - w ) 2 = - « 2 - 


So far we have made no assumptions about the coins and have not 
remarked upon any expectations which we might have had. Our task has 
been simply to record a set of events and to describe them as conveniently as 
possible. If, however, we now make assumptions about the coins and the 
way in which they are spun, we can use the probability theory of chapter 7 
to predict our expectations. Suppose, for instance, we assume that the coins 
are both completely unbiased, and they are spun with absolute fairness; 
then the probabilities p r of obtaining x r heads are clearly: 


No. of heads (x r ) 
x l = 0 
x 2 = 1 

*3 = 2 


Probability (p r ) 

Pi=\ 

P 2 ~ 2 

P3 = 4 


This set of probabilities is referred to as a probability distribution , and the 
set of assumptions (which must be expressed in precise probability terms) 
from which it arises is called the probability model 

If different assumptions were made, we would have a different proba¬ 
bility model, and so a different probability distribution. For instance if 
one of the coins was double-headed, and the other was fair and unbiased, 
then the probability distribution would be as follows: 



Pr 


Pi = 0 
P2 =i 
P3 = i 


One of the major tasks of statistics is to compare an observed frequency 
distribution with the probability distribution which arises from a par¬ 
ticular probability model, so equivalent measures to the mean and variance 
of a frequency distribution are required for a probability distribution. To 
maintain a clear distinction between the two distributions, such probability 
measures are denoted by the Greek letters //, a 2 . 
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Now m - Y,gr x r and s 2 = Z&(*r - m ) 2 > 

so n = L /v*r and = Z A(*r — M) 2 - 

As in statistics, so with probability distributions, /i is called the mean, 
or expected value of x, o 2 its variance and o its standard deviation. 

So for the probability distribution with two unbiased coins, 

fi = ^ x 0 + | x 1 + { x 2 = 1 
<j 2 = \ x l 2 + { x 0 2 + { x l 2 = 0.5 
(whereas in our trial, m — 1.2 and j 2 = 0.36). 

[It should be pointed out at this stage that in the calculation of and a, 
just as for m and s , the values of x r are not restricted to integers: see, for 
instance, Exercise 12 . 1 a, no. 8.] 

In summary: 


Statistics 

Probability 

Sr 

Relative frequency g r = — 

n 

Probability p r 

f 

Mean m = Y — 
r n 


Mean (expected value) ^ = £ prX r 

r 

Variance s 2 = < 

Z- {*r - m ) 2 

r n 

sr^Jr 2 2 

L~*r ~ 171 
{'7 n 

Variance a 2 — < 

Z Pr(*r ~ V) 2 

r 

Z Pr4 - V 2 

^ r 


Exercise 12.1a 

1 Calculate the expected value and variance of the number of heads 
showing when three unbiased coins are tossed together. Conduct a series 
of 20 such trials and calculate the mean and variance for the frequency dis¬ 
tribution that you obtain. Compare your theoretical and experimental 
results. 

2 Calculate: 

(i) the variance of the score obtained on throwing a single die; 

(ii) the variance of the total score obtained on throwing two dice; 

(Hi ) the variance of the average score on two dice. 

How are your answers to (i) and (ii) and to (ii) and (Hi) related? 
(Assume that the dice are ‘fair’.) 

3 A computer is made to produce randomly the numbers 0, 1, 2, .. ., 9. 
What is the expected value and variance of numbers so produced? 
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4 On the basis of past evidence, it is estimated that the probabilities of a 
certain type of plant having 4, 5, 6 , 7, 8 , 9 leaves are 0.13, 0.21, 0.38, 0.16, 
0.09, 0.03 respectively. Find, correct to 2 decimal places, the expected 
value and variance of the number of leaves on such a plant. 

5 Two unbiased dice are rolled and the greater score (or either if they are 

the same) is recorded. State the set of possible scores and the probabilities 
associated with these scores. Find the expected value (i.e., theoretical 
mean) of the recorded scores. (s.m.p.) 

6 A boy spins a coin until he obtains a head, but impatiently gives up if 
he does not succeed in five attempts. What are the expected value and 
variance of the number of times he spins the coin? 

7 What are the mean and variance of the number of different factors 
(other than 1 and the number itself) of an integer chosen at random in the 
range 1 to 30? 

8 A box contains 100 tokens which differ in mass, but are otherwise 
identical. 20 of the tokens have a mass of 4.8 g each, 35 a mass of 5.2 g, 
25 a mass of 5.7 g, 15 a mass of 6.5 g, and the remaining 5 have a mass of 
8.0 g each. Calculate (to 2 decimal places) the expected value and variance 
of the mass of a token chosen at random from the box. 

9 An examination question consists of two parts, A and B, and the 
probability of a pupil getting part A correct is f. If he gets A correct, the 
probability of getting B correct is f; otherwise it is There are three marks 
for a correct solution to part A, two marks for part B, and a bonus mark if 
both parts are correct. Calculate the expected value and variance of the 
pupil’s total mark for the question. 

10 In a game where the gambler rolls two dice, ona£l stake the casino 
pays out £10 for a double six, and £3 for a score of seven (the stake money 
being returned as well), and for any other score the gambler loses his 
money. What is the casino’s expected profit (to the nearest penny) on a 
£100 stake? 

n Two small piles of cards contain respectively the 1, 2, 3, 4 , 5 , 6 of 
diamonds and the 7, 8 , 9, 10 of diamonds. Verify that these sets of numbers 
have expectations 3^ and 8 ^ respectively, and that the variances are 
and 1 | respectively. 

One card is drawn at random from each of the piles and the product 
of the numbers so formed is calculated. Find the expected value and 
variance of this product. 

Find also the probability that the product from any one draw will exceed 
its expectation. (c.) 

12 A motorist drives into town and has the choice of two car parks. It 
takes him 20 minutes to drive from his home to car park A, which is never 
full, and then it takes him 15 minutes to walk to the office. If he decides 
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to try to get to car park B it takes him 25 minutes to get there from his home. 
The probability that car park B is not full is />, and in this case he can walk 
to the office in 5 minutes. If car park B is full he drives back to car park A, 
which takes 5 minutes. If he always drives first to car park B find the proba¬ 
bility distribution of the time T , in minutes, taken to get to work. Find the 
expected value of T, showing that its value is less than 35 if p exceeds §. 
Find also the variance of T. 

If however, the motorist tosses a coin in order to decide whether to go 
straight to car park A or to try car park B first, find, in this case, the proba¬ 
bility distribution of T, and the expected value of T. (c.) 


Probability generators 

Suppose that we want to find the mean and variance of the geometric 
probability distribution (see 7 . 4 ) of the number of throws required of a 
‘fair’ die until a six is obtained. 

The probability distribution is given by: 

No. of throws Probability 


x l = 

1 


Pi 

= 

1 

6 




x 2 — 

2 


Pi 

= 

1 

6 

X 

5 

6 ‘ 

__ 5 
“ 36 

x 3 = 

3 


P3 

= 

1 

6 

X 

(t: 

4 2 _ 25 

> ~ 216 

x r = 

r 


Pr 

= 

1 

6 

X 

e§: 

r 1 

Hence 

P = 

ix 

1 + 

5 

36 

X 

2 

+ 

#6*3+--- 

and a 2 = 

(£ x 

l 2 + ; 

5 

36 

X 

2 2 

+ fk X 3 2 + 


Direct calculation of such infinite sums will be lengthy, to say the least, 
and even where the number of outcomes is not infinite, calculation of ju 
and a 2 can be very tedious. But, particularly when the variables x r are 
integers, calculations can be eased considerably by using the probability 
generator (or as it is sometimes known, the probability generating function ): 

Let the outcomes which have associated random variables 0, 1,2,... 
have probabilities denoted by p 0 , p x , p 2 , . . • • Then the probability 
generator G(t) is defined by 

G(t) = X PX — Po T p\t + p 2 t 2 + • * * + p r t r -(-•••* 

In the probability generator, t is a dummy variable: it has no significance 
in itself, and any other letter would do just as well. We merely use the 
powers of t to pick out equivalent probabilities — the coefficient of f in 
G ( t ) giving the probability of obtaining the value n for the random variable. 
It should also be noted that G(t) may be a finite or infinite polynomial in t. 

It is possible also to define a probability generator when the random 
variables are negative or non-integral (see Exercise 12 . 1 b, no. 13). In such 
a case, of course, the generator will not be a polynomial, but nevertheless 
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the properties of G(t) which we will shortly derive will still hold. In practice, 
however, such probability generators are not often used. 

Clearly the probability generator provides us with a means of displaying 
a complete probability distribution in a single expression. For instance, the 
distribution for the number of heads obtained on throwing two unbiased 
coins is summarised by the generator: 

G{t ) =? + £*+ 4 1 2 . 


It is not immediately obvious how a probability generator can be used to 
evaluate jU and <r 2 , but we shall see shortly that they are calculated from the 
first and second derivatives of G(t). Meanwhile, to aid differentiation, the 
generator will need to be written in as simple a form as possible, so let us 
simplify the generator of our geometric distribution: 

G(t) = ^ + * x + * x (f) V +.- + ix (*)'“■ l f + • • • 



Now the contents of the square brackets can be recognised as a geometric 
progression whose sum is 

“1 - (5f/6)"' 


lim , 

^°°L 1 - 5*/6 J 

and since t has no significance in itself, we can choose it such that |5f/6| < 1, 
so that ( — ) —► 0 as n —> oo. 

1 


Hence G(t) = p x 


1 - 5t/6 6-5 1 


Derivation of fi, a 2 from the probability generator 

When the random variables x t , x 2 , . . . are 0, 1,2,... the formulae for 
expected value and variance take the form: 

H = Trp r a 2 = JV/v - fi 2 . 

Now G(t) = YP/ 

=> G'(t) = Yj r P/~ l => G\\)=Y J rp r 

=> G"(t) = Y.r{r- 1 )p/~ 2 => G"(l) = Ir(r - 1)A 

=> G"(l) +G'( 1 ) =Zr 2 /, r . 

Hence [i = Y, r Pr = G'{1) 

and a 2 = X r 2 /v - p 2 = G"{ 1) + G'{ 1) - [G'(l)] 2 . 


P = G'(l) 

a 2 = G"(l) + G'(l) - [G'(l )] 2 


Summarising: 
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Returning now to our example, 


G(t) 
=> G'(t) 


=> G"(t) 




t 

6-5 1 
6 — 5t + 5t 
(6 - 50 2 
6 

(6 - 5 1) 2 ^ 
60 

(6 - 5 If ^ 

6 and a 2 = 


G'(l) = 6 

G"(l) = 60 
60 + 6 - 6 2 


= 30. 


Hence the expected number of throws required to obtain a six is 6 , and 
the standard deviation of the number of throws is ^30. 


Binomial distribution 

We saw in 7.4 that if the chance of a particular event occurring in a single 
trial is/?, then in n such independent trials, the probabilities of 0, 1,2,3,. . ., n 
occurrences are given by the terms of the binomial expansion 

(q + /?)", where q = 1 — p. 

These probabilities, therefore, are also the coefficients of successive 
powers of t in the expansion of 

G(t) = (q + pt) n 


which is therefore the probability generator of the binomial distribution. 

Now G(t) = (q + pt) n 

=> G\t) = np{q + pt) n ~ l 

=> ji = G'(l) = np{q + p) n ~ l = np , since q + p = 1 . 

Furthermore G”(t) = n(n — 1 )p 2 (q + pt) n ~ 2 

=> G"'(l) — n(n — 1 )p 2 

=> cr 2 = G”{ 1) + G'( 1) - [G'(l )] 2 

= n 2 p 2 — np 2 + np — n 2 p 2 
= np{ 1 - p) 

= npq. 


Hence 


li = np (7 = yfnpq 
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The result that, for a binomial distribution, 

= (? + ^)" = (? + ^0 (? + />*) • • • (q + pt), 

where q + pt is the generator of a single trial, illustrates the fact that for a 
probability distribution arising from a series of independent (and not neces¬ 
sarily identical) trials, the overall probability generator is the product of 
the generators of each of the separate trials. 

Example 

In a certain large city, it is known that ^ of the voters support the radical 
party. In a sample of 12 voters, what is the expected value and standard 
deviation of the number of radicals? 

We have therefore a binomial probability situation with n = 12, p = ^ 
and q = §. 

So n = n p = 12 x } = 4 

and o = yfnpq = ^(12 x j x |) = ^/f % 1.63. 

Exercise 12 . 1 b 

1 Calculate the probability generator for the number of tosses of an 
unbiased coin needed to obtain the first head, and hence evaluate the 
mean and variance of the number of tosses. 

2 Calculate the expected value and standard deviation of the number of 
heads obtained when an unbiased coin is tossed (z) 4, (ii) 36, (Hi) 100 times. 

3 In the next general election suppose that 36% of the electors intend to 
vote Liberal. Find, in terms of n, the mean and standard deviation of the 
percentage of those who intend to vote Liberal in samples of size n . 

(s.M.P.) 

4 Six players take it in turns to cut a pack of cards (excluding jokers), and 
each time the cards are replaced before the next player cuts. Calculate the 
expected value and standard deviation of the total numbers of spades 
turned up when each player has cut. 

5 The probability of a pupil arriving at school late on any given day is . 
What is the probability of his being punctual for a whole week (i.e., 

5 school days) ? Calculate the mean and variance of the number of days he 
will be late in a school term consisting of 14 weeks (i.e., 70 days). Also cal¬ 
culate the expected number of completely punctual weeks in the term. 

6 Seedlings are planted in 10 rows of six each. The probability of a seedling 
dying before it flowers is ^. Calculate the mean and variance of the number 
of rows in which all the seedlings flower. 

7 Eight per cent of eggs sold by a certain grocer are brown. 

(i ) Calculate the mean and variance of the number of brown eggs obtained 
in a carton of 12 . 
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(ii) By finding the probability of a carton containing no brown eggs, and 
using a suitable probability generator, find how many cartons a housewife 
can expect to buy before she finds her first brown egg. 

8 Prove that for a binomial probability distribution arising from n trials, 
the maximum possible value of the variance is /z/4. 

9 An electronics firm packs the resistors that it produces in boxes of 800. 
On average one component in 100 is faulty. Calculate the expected number 
of defective resistors in a box, and the variance of this number. How could 
you quickly obtain a good approximation to the variance in this case? 
What would the percentage error (of the actual value) be if this approxima¬ 
tion were used? 

10 A box contains a large number of screws. The screws are very similar 
in appearance, but are in fact of 3 different types, A, B and C, which are 
present in equal numbers. For a given job only screws of type A are suitable. 
If 4 screws are chosen at random, find the probability that: 

(i) exactly two are suitable; 

(ii) at least two are suitable. 

If 20 screws are chosen at random, find the expected value and variance of 
the number of suitable screws. (c.) 

11 In a machine game of chance, when a lever is pulled, one of the 

numbers 1, 2, 3 appears in a window. The lever is pulled 5 times and the 
total score is recorded. If the probabilities associated with the numbers 
1 , 2 , 3 are e ? i? 2 respectively, write down an expression for the probability 
generator G(t) for the possible total scores. Evaluate G( — 1) and, hence or 
otherwise, find the probability that the total score is even. (s.m.p.) 

12 By using the fact that the probability generator for the number of 
throws of a die required to obtain a six is t /(6 — 5t), or otherwise, obtain 
the probability generator for the number of throws required to obtain two 
sixes (not necessarily consecutively). Hence calculate the expected value 
and variance of the number of throws required to obtain two sixes. 

13 A multiple-choice examination paper consists of 25 questions, each 
with 5 possible answers, only one of which is correct. A student gains 4 marks 
for a correct answer, and loses one mark for an incorrect answer. 

(i ) An entirely ignorant student has to guess the answer for each question. 
Find the probability generator for the number of marks he obtains on the 
paper, and hence calculate his expected total mark, and the standard 
deviation of that mark. (Hint: first find the probability generator for the 
marks on a single question.) 

(ii) A rather more intelligent student has twice as much chance of choosing 
the correct answers as he has of choosing any one of the wrong answers; 
find the expected value and standard deviation of his total score on the 
paper. 





12.2 CONTINUOUS PROBABILITY DISTRIBUTIONS 281 


14 A petrol company gives away one medal with every purchase of 
petrol. Each medal is equally likely to be any one of a set of n medals. At a 
stage when a customer already has s different medals, show that the 
expected number of purchases to get the first medal different from those 
already held is n/{n — s). Deduce that the expected number of purchases 
for a complete set is 



1 1 
+ 2 + 3 



(c.) 


12.2 Continuous probability distributions 

Statement i : The probability of throwing a 5 with a fair die is j-. 

Statement 2 : The probability of a voter, chosen at random in London, being 
a Liberal is ^. 

Statement j: The probability of the new machine turning out a component 
of length 241 mm is 

Superficially the three statements above seem very similar. They all 
make predictions about what may occur if we conduct certain trials, and 
we presume that there is some foundation, theoretical or experimental, for 
each. But let us examine the statements separately, and in a little more 
detail. 

Statement / is presumably based on theoretical considerations (themselves 
supported by past experience) and uses the notion of‘equal likelihood’. 
It tells us that if we conduct a large number of trials, about £ of the outcomes 
will be 5’s; and that, in general, the larger the number of trials, the closer 
we get to this fraction. 

Statement 2 can only be based on statistical evidence, and tells us that in a 
large, randomly chosen group of voters, we could expect about £ to be 
Liberals. 

Both of these statements are concerned with discrete (i.e., definite and 
distinct) outcomes: when a die is thrown there are only six distinct results 
possible, and similarly a voter has political views which can be expressed 
in certain distinct choices (and we would regard ‘ don’t know’ as a category 
of choice). 

Statement 3 however, is concerned with length, a continuous quantity, and 
presumably tells us that if, for instance, we look at 150 components pro¬ 
duced by the machine, about 25 will be 241 mm in length. But if we take 
these 150 components, and measure each by a micrometer to 0.001 mm, 
then we would certainly not expect 25 of them to be exactly 241 mm in 
length, and the statement in its present form is meaningless. Perhaps it is 
more likely to mean that if measurements are taken to the nearest millimetre , 
then the probability of a length of 241 mm is £? 
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So although we can never be absolutely certain about the exact length of 
any object, and must say that 

(length — 241 mm) = 0, 

nevertheless our original statement can meaningfully be written as: 
p{ 240.5 ^ length ^ 241.5) = 

Indeed for any continuous quantity, we cannot talk (as we did for 
discrete quantities) about the probability of it attaining a specific value. 
We can only meaningfully talk about the probability of length, time, 
etc., lying within a certain range of values. 


Cumulative probability 

Even though, for a continuous variable like length, we cannot use 
p (length = 241 mm), it would clearly be valuable if we could associate 
some form of probability with any given length. This is done by use of the 
notion of cumulative probability. 

The cumulative probability of length 241 is defined to be the probability 
that the length is less than (or equal to) 241, and is written 

0(241) = jfr(length < 241). 

More generally, the cumulative probability function f for any variable x is 
defined by 

O(X) = p(x ^ X), 

so that 0 ^ O(X) ^ 1 for all X. 

(It should be noted that we can also meaningfully talk about the cumulative 
probability of a discrete variable; and for instance if we throw a single die, 
0(5) = f. But clearly this notion is less useful, and is less frequently used.) 


Example 1 

In a race, the times of the competitors are measured by stop-watch, to the 
nearest jo of a second. What is the cumulative probability function for 
error in measured time? 

Let the error be denoted by x, which is uniformly distributed over the 
interval —0.05 to 0.05. 

Therefore <D(-0.05) = 0 and <D(X) = 0 for any X ^ -0.05 
<[>(0.05) = 1 and ®(X) = 1 for any X ^ 0.05 

and <D(X) increases steadily in value from 0 to 1 over the interval 


f Also called a distribution function. 




12.2 CONTINUOUS PROBABILITY DISTRIBUTIONS 283 

-0.05 < X ^ 0.05. 

So O(Z) = 10(Z + 0.05), if -0.05 ^ Z < 0.05. 



Example 2 

Particles are emitted uniformly from a radioactive source in the corner of a 
room. A screen (of theoretically infinite length) is positioned one metre 
away from the source, so that all particles must strike the screen. What is the 
cumulative probability function of the distance of the point of impact from 
the bottom of the screen? 



Since the particles are emitted uniformly over the complete arc of jn 
radians, 

®(Z) = p (particle striking AB) 

= fraction of total angle \n subtended by AB 


6 


-nK 


29 

71 


X 


But since tan 6 = — = X 


then 


6 = tan -1 Z. 


Hence ®(Z) = — tan 1 Z 

71 

in the interval Z ^ 0. 
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Probability density function 

The cumulative probability function, however, does have the disadvantage 
of often being cumbersome to use in practice. From its graph, for instance, 
we cannot easily picture the overall probability distribution, and we nor¬ 
mally use instead the j Probability density function (p.d.f.), derived as follows: 


By definition of O(X), we know that 
p(a ^ x ^ b) = <D (b) — O(fl). 

Hence p{X < x ^ X + SX) = 0(X + SX) - <b(X) 

d>(X + 5X) - O(X) sv 
=- 5Z-** 

Now as <5 X -> 0, we know that + ^ ~ -> V(X). 


Hence p(X < x < X + SX) « <V(X) SX. 


Clearly the function <D' (x) is very useful, since it enables us to find the 
probability of x lying in any small interval; and since the probability is 
equal to this function multiplied by the width of the interval, the function 
is referred to as the probability density function, and is rewritten </>(x). 


Since </>(x) = <&'(*), it follows that 


<D(X) = 


4>(x) dx , where L is the least possible value of x. 


Furthermore, the probability that x lies in the interval [ 0 , b~\ is clearly 
seen to be 


*b 

4 >(x) dx 

a 


and is therefore represented by the area shaded on the diagram. 



It therefore follows, since <D(x) is an increasing function and <j>(x) is its 
derivative, that p(x) ^ 0; and, since the total probability must be equal 
to 1, that 
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<j)(x) dx = 1, 


where L and G are the least and greatest possible values of x . 


Mechanical analogies 

* There are several analogies to probability density, and perhaps the two 
most obvious are velocity and mass density: 

(i) Probability is calculated from probability density in exactly the same 
way as distance is calculated from velocity. For if t denotes time and v(t) is 
the velocity at time then \ b a v(t) dt is the distance travelled between 
times a and £, just as J fl (j){x) d* is the probability of x lying between a and b ; 
and these integrals are represented as shaded areas under their respective 
graphs: 



(ii) Similarly, just as density measures the rate of build-up of mass, so 
probability density measures the rate of build-up of probability. Suppose 
we have a metal rod of non-uniform material which has mass 1 kg, length 
2 m and constant cross-section (so that we can regard the density as mass 
per unit length). Then if the density is graphed as 



we see that most of the mass is concentrated to the right of the rod’s mid¬ 
point, just as in the equivalent probability distribution: 
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Returning to Example i, 

®(*) = 10(* + 0.05) for —0.05 < x ^ 0.05 
then <j)(x) = ®'(x) 



for -0.05 ^ a ^ 0.05 
elsewhere. 



This continuous probability distribution is therefore analogous to the 
discrete rectangular distribution discussed in chapter 7 . 


Similarly, in Example 2 , 


x 2 -1 
<P(x) = — tan * 

n 

for x ^ 0 . 

So <f>(x) = 


r 2 

for x ^ 0 

= J 7l(l + X 2 ) 

{ 0 

for x < 0 . 



Example 3 

For a circular disc of radius 4 cm, what is the probability density function 
of the distance (r) from the centre of a point chosen at random on the disc? 
Use the p.d.f. to calculate the probability that a point lies between 2 and 
3 cm from the centre of the disc. 
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In this case it is easier first of all to calculate <X>(r). 

Now (D(r) = (distance from centre ^ r ) 

= fraction of total area whose distance < r 

9 

nr 


287 


n x 4 2 

jz 


16 


<t>(r) = <V{r) = - 


f 4 f 4 r r 2 4 

and (p(r) dr = \ -dr = — = 1 , as required. 

Jo Jo ° L 16 Jo 


Now, using (j)(r ), 


* J<r<3) -£!i d '-[££ = *• 


Exercise 12.2a 

1 A p.d.f. is given by 
fx + 1 


</>(*) = 


for 0 ^ x ^ a 

for x < 0 and x > a. 


By using the property of the p.d.f. that J 0(x) dx = 1, find the value of a. 
Obtain an expression for the cumulative probability function. 

Calculate: (i) p(x < ^); (ii) p(x > 1). 

2 Verify that 


4>{x) = 


r for x ^ 1 
x 

0 


is a p.d.f. 

for x < 1 . 

Find the value of X such that p(x < X) = 0.5. 
Calculate: (i) p(x > 10); (ii) p (2 < x < 4). 

3 A p.d.f. is given by 

f j-x for 0 ^ x < 3 

0(x) = j |(4 — x) for 3 ^ x ^ 4 

lo for x < 0 and x > 4. 


Sketch the graph of 0(x). 

Calculate: 

(0 the probability that x occurs in the interval [ 1 , 2 ]; 
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(ii) the probability that x > 2. 

Obtain the cumulative probability function, and hence, or otherwise, 
find the median of the distribution. 

4 The p.d.f. of a distribution is given by <j>(x) = k( 1 — x 2 ) for — 1 ^ x ^ 1 , 
and <j>(x) =0 elsewhere. 

Find the value of A;, and hence calculate: 

(*) P(~i ^ * < 2 ); («) p(x > !)• 

Obtain an expression for the cumulative probability function of the dis¬ 
tribution. 

5 The p.d.f. of a distribution is given by <j)(x) = a sin nx for 0 ^ x < 1 , 
and 4>(x) =0 elsewhere. 

Find the value of a , and obtain an expression for the cumulative proba¬ 
bility function. 

Calculate: (i) p(x < j); (ii) p(j < x < f). 

6 The probability that a transistor in a radio lasts less than t hours is 
1 — e ~ t/2 00°. Find the p.d.f. for the lifetime of a transistor. 

(■ i ) What is the probability that a transistor lasts more than 4 000 hours? 
(ii) What is the probability that a transistor ceases to function after 2 000 
hours of use but before 3 000 hours. 

(Hi) If a radio contains 8 transistors, what is the probability that none of 
them fails before 1 000 hours of use? 


Parameters of a continuous distribution 


In a discrete probability distribution, the expected value and variance 
were defined by 


M = E X rPr and <T 2 = E ( x r ~ tfpr 

r r • 

= E X rPr ~ H 2 - 


In a continuous distribution, the probability corresponding to p r is given 
by (f> (x) Sx. 

So fi = lim Yj x<j>(x) Sx and a 2 = lim Y (x — fi) 2 4>(x) Sx 

= lim Y x 2 (f)(x) Sx — ix 2 




-r- 


'(j)(x) dx 


and 


" 2 - f ( * ■ 


fi) 2 (j)(x) dx 


*G 

x 2 (j)(x) d X — }JL 2 . 
L 


Example 4 

The p.d.f. of the lifetime (t), in minutes, of atoms of a radioactive element 
is given by <f>(t) = 3 e ~ 3t . Find the expected lifetime of an atom, its variance, 
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and the half-life of the element (the time it takes for half of the material to 
decay). 

The expected lifetime, /t, is given by 


M = 


3t e~ 3t d t. 


Integrating by parts, 


~ 3 1 


d t 


= 0 + 


- 3 1 


= -a mm. 


and 


-r 


3 1 2 e’ 3t d t-i 


= [-t 2 e~ 3 % + 


2 te~ 3t d t-i 


i i 
9 


= 0 + f x | — 

= i min 2 . 


Now if T is the half-life, then 

t%T 


r 


3 e~ 3t d t = j 


1 -e~ 3r = j 

,~3T _ 1 


3T = In 2 

T = | In 2 « 0.23 min. 


Exercise 12.2b 


1-5 Calculate the mean and variance of each of the probability distri¬ 
butions defined in Exercise 12 . 2 a nos. 1-5. 


6 A random variable x has cumulative probability function 


fo 


®(*) = i 



u 


(x ^ a) 

(a < x < b ). 
(x ^ b) 


Find the p.d.f. (j>(x), and sketch the graph of </)(x). Obtain the mean and 
variance of x. (c.) 
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7 The probability density function, <j)(x) 9 of a random variable x 9 is 
given by 

4>(x) = a# (4 — x) if 0 < x < 4. 

= 0 otherwise. 

Find the value of a, and hence find the mean and variance of x. What is 
the probability that x lies between 0 and 1 ? (a.e.b.) 

8 A random variable x has the cumulative probability function 

ro (x < o) 

®(*) = \ kx 3 (0 ^ x ^ 2) 
ll (x > 2) 


where A: is a constant. Find the mean, median, and variance of x. 

(m.e.i.) 

9 A probability distribution has the probability density function 


4>{x) 


|0 X < 2 

x> 2, k constant. 


Find, in terms of A, the mean, median and standard deviation of the 
distribution. (m.e.i.) 

10 The probability that a light bulb lasts longer than t hours is 
Find the probability density function for the lifetime of a bulb. 

Show that the mean lifetime is fi. 

If the mean lifetime is 1 500 hours, how unlikely is it that the bulb will 
last more than 3 000 hours? 

If the manufacturer wants to ensure that less than one in a thousand 
bulbs fail before 5 hours, what is the lowest mean lifetime he can allow his 
bulbs to have? (s.m.p.) 


11 A random variable x has the probability distribution 

4>{x) dx — C dx ( — 2a ^ x ^ —a, a ^ x ^ 2a) 

4>(x) dx = 0 elsewhere. 

Sketch the distribution. Obtain the value of the constant C, and the 
standard deviation,, er, of the distribution. 

Find the probabilities of obtaining values of x 

( i ) within one standard deviation of the mean; 

(ii) within two standard deviations of the mean. 

Find the value of k such that /?(|a:| < ha) = 0.95. (m.e.i.) 

12 A mathematical model for the fraction x of the sky covered with cloud 
(0 < x < 1 ) assigns to this a p.d.f. 

^ x) = 7(Tr?)' 
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Calculate: 

(i) the value of k ; 

(ii) the expected fraction covered by cloud; 

(Hi) the probability that not more than j of the sky is covered. 

[ Hint : Your integrations may be made easier by using the substitution 
x = sin 2 6. You may assume that this substitution is valid, even though the 
function to be integrated may be discontinuous at the ends of the interval 
of integration.] (s.m.p.) 

13 The probability density function of a distribution is given by 
6 X ^ 

<t>{x) = ^ —Pjl (x > 0, A integer > 0). 

Find the expected value and variance of x. 

Sketch (fr(x) when X — 2. (m.e.i.) 


Histograms 

In section 12.1 we saw how the relative frequency g r corresponded to the 
probability p r . Is it possible similarly therefore to find some statistical 
equivalent to the probability density (j) (x) ? We shall see that such a measure 
arises from the use of what is known as a histogram to display data. 

Consider the data shown below which gives the time, in seconds, between 
succeeding vehicles passing a given point on a main road, during a traffic 
survey. (Measurements were taken to the nearest second, so that a measure¬ 
ment registered as one second could actually be anywhere in the range 0.5 
to 1.5 seconds. All the time intervals in the table are given in this way.) 


Time interval 

No. of vehicles (f r ) 

Rel. frequency (g r ] 

0.5-1.5 

11 

0.11 

1.5-2.5 

19 

0.19 

2.5-3.5 

23 

0.23 

3.5-4.5 

14 

0.14 

4.5-5.5 

10 

0.10 

5.5-10.5 

15 

0.15 

10.5-20.5 

8 

0.08 


Too 

1.00 


These results can be displayed in a relative frequency diagram: 




But it is immediately clear that this diagram gives a very misleading 
impression, and in order to correct this we require a diagram in which 
areas of the successive rectangles represent the corresponding relative 
frequencies. We therefore divide each relative frequency by the width of 
the corresponding interval. The resulting quantity is sometimes known as 
the relative frequency density , and the corresponding figure (whose total area 
is equal to 1) is a histogram : 



As a further extension of the traffic survey, 1 000 times between successive 
vehicles were measured to a greater accuracy, this time correct to 0.1 s, 
and the resulting histogram was: 
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The general form of this histogram is obviously similar to that of its pre¬ 
decessor; but, because the time intervals are now much smaller, the out¬ 
line of the histogram is correspondingly smoother. It is not difficult to see 
that as the time intervals become still smaller, the outline of the histogram 
will tend to a curve, which is therefore the statistical equivalent of the graph 
of the probability density function. 


Exercise 12.2c 

1 Conduct a traffic survey similar to that described in the text on a local 
main road, and record your results on a histogram. 

2 The lifetimes (in completed weeks) of 100 mice of a certain species are 
given below. 


Life time/weeks 

0-9 

10-39 40-49 50-59 60-69 

70-79 80-99 

No. of mice 

8 

3 7 18 31 

22 11 


Represent the data on a histogram. 

3 Each of a group of 50 schoolboys attempts to throw a javelin for the 
first time. The distances they achieve (to the nearest metre) are given below. 


Distance/m 

0-19 

20-29 

30-34 

35-39 

40-49 

No. of throws 

5 

11 

18 

13 

3 


Represent the data on a histogram. 

4 120 people set out on a sponsored walk to raise money for charity. The 

distances they covered are given below. 


Distance/km 

less than 10 

10-15 

15-20 

20-30 

30-50 

No. of walkers 

12 

15 

25 

41 

27 


Represent the data on a histogram. 

5 From a sample of 100 marriages taking place at a registry office, the 
ages of the husbands are recorded below. 


Age 

16-20 

21-25 

26-30 

31-40 

41-60 

No. of husbands 

10 

38 

35 

13 

4 


Represent the data on a histogram. 

12.3 The Normal distribution 

In the last section we derived methods of using probability models to 
describe continuous data. We now look in some detail at the most important 
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(though by no means the only) continuous probability distribution. We 
shall first of all see, by looking at sets of statistical data, how it most fre¬ 
quently arises in practice. The data given in the following three examples 
is in each case continuous and each set of data is illustrated by its corres¬ 
ponding histogram. 


/. Masses offive-penny pieces 

100 five-penny pieces were weighed and their masses were classified into 
intervals of 0.05 g. The resulting frequency table, with corresponding rela¬ 
tive frequencies and relative frequency densities was found to be as follows: 


Mass/g 

Frequency 

Rel. freq. 

r.f.d. 

5.45-5.50 

i 

0.01 

0.2 

5.50-5.55 

3 

0.03 

0.6 

5.55-5.60 

4 

0.04 

0.8 

5.60-5.65 

16 

0.16 

3.2 

5.65-5.70 

37 

0.37 

7.4 

5.70-5.75 

27 

0.27 

5.4 

5.75-5.80 

10 

0.10 

2.0 

5.80-5.85 

1 

0.01 

0.2 

5.85-5.90 

1 

0.01 

0.2 

Total 

100 

1.00 



, r.f.d. 



5-90 
mass /g 


2 . Lifetimes of electric light bulbs 

The lifetimes of 200 electric light bulbs were classified into intervals of 200 
hours, as follows: 
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Lifetime/h 

Frequency 

Rel. freq. 

r.f.d. x 10 3 

800-1 000 

2 

0.010 

0.05 

1 000-1 200 

7 

0.035 

0.175 

1 200-1 400 

17 

0.085 

0.425 

1 400-1 600 

43 

0.215 

1.075 

1 600-1 800 

78 

0.390 

1.950 

1 800-2 000 

34 

0.170 

0.850 

2 000-2 200 

15 

0.075 

0.375 

2 200-2 400 

4 

0.020 

0.100 

Total 

200 

1.000 




3. Heights of adult males 

The heights of 1 000 adult males were measured and classified into intervals 
of 2 cm: 


Height/cm 

Frequency 

Height/cm 

Frequency 

Height/cm 

Frequency 

150-152 

2 

166-168 

101 

182-184 

23 

152-154 

3 

168-170 

116 

184-186 

16 

154-156 

5 

170-172 

125 

186-188 

7 

156-158 

9 

172-174 

112 

188-190 

2 

158-160 

16 

174-176 

102 

190-192 

2 

160-162 

29 

176-178 

93 

192-194 

1 

162-164 

55 

178-180 

61 

194-196 

0 

164-166 

79 

180-182 

40 

t 

196-198 

1 
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From this table it is again easy to calculate the corresponding relative fre¬ 
quencies and relative frequency densities, which can be plotted as follows: 



The three histograms appear to have the same general shape; they are 
approximately symmetrical, with most of their area concentrated in a 
central region, but tailing off fairly rapidly at each end. In the third 
example, where the total frequency and the number of class intervals are 
both greater than in the other two examples, the symmetry is more marked 
and the outline of the histogram could be approximated by a ‘ bell-shaped 5 
curve. The histogram is reproduced below with such a curve superimposed, 
and similar (though less closely-fitting) curves are also superimposed on 
the other two histograms. 

1. masses: 2. times: 3. heights 




Though all have the same general bell-shape, these three curves do differ 
in certain ways: their axes of symmetry are in different positions, and one is 
‘short and wide’ whilst another is ‘tall and thin’. Such differences, of 
course, follow from the differences in position and spread (however they 
may be measured — though usually by m and s) of the original sets of 
statistical data. In fact these three curves which we have superimposed 
on the histograms were carefully chosen and are defined by a single func¬ 
tion involving two parameters, their differences being due solely to differen¬ 
ces of these parameters. This function is known as the Normal probability 
function , and satisfies all the conditions for a probability density function. 
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(The term Gaussian function is sometimes also used, after one of its earliest 
and most famous investigators, and Gauss himself called it the error function, 
from the way in which it originally arose.) 

Continuous data which has a Normal p.d.f. is said to be Normally dis¬ 
tributed. It must be strongly emphasised, however, that in many cases data 
are only approximately described by the Normal function. In our second set 
of data, for example, the lifetimes of electric light-bulbs, though having an 
approximately bell-shaped distribution are not exactly Normal in their 
distribution. Even so, this should not be regarded in any way as 4 abnormal’, 
for although the Normal distribution is extremely important, it is far from 
being universal. Indeed, perfect ‘normality’ is highly exceptional! 

The equation of the Normal function will be derived later in the section, 
when further properties of the Normal distribution have been discussed. 
But for the present we will simply quote its probability density function as: 


4>{x) = 


1 -(*-/<) 2 / 2» 2 
y/{2n)o 


The parameters fi and o can be shown, as might be suspected, to be the 
mean and standard deviation of the distribution. 



Properties of Normal probability curve 

1 The curve is symmetrical about the line x = fi. 

2 The curve is asymptotic to the *-axis, i.e., </> (x) —► 0 as x —> +00. 

3 The total area beneath the curve equal to 1 . This is, of course, a neces¬ 
sary condition for (j)(x ) to be a p.d.f., and also accounts for the value of the 
constant l/yj( 2 n)a in the equation for </>(x). 

4 Just over 68% of the area beneath the curve lies between pi — c r and 
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ju -T < 7 , so that approximately § of the data lies within one standard devia¬ 
tion of the mean. 

Similarly 95^% of the area lies between fi — 2a and fi + 2a , 
and 99f% lies between ji — 3a and /i + 3a. 


The standard Normal function 

We now develop a single method of dealing with all Normal probability 
distributions, irrespective of the values of their parameters and a. This is 
done by standardising the variable, that is transforming it so that its new 
mean and standard deviation become respectively 0 and 1. 


Example 1 

The marks in a mathematics examination are found to be approximately 
Normally distributed with mean 56 and standard deviation 18. Standardise 
the marks and find the standardised equivalent to a mark of 70. 

(Examination marks, of course, are discrete, rather than continuous, 
variables; nevertheless, as we shall see in later examples, a continuous 
probability curve can give us a good approximation to a discrete statistical 
distribution.) 

Let us denote the examination marks by the variable 1 We first reduce 
the mean to zero by subtracting 56 from each mark. So x becomes x — 56. 

As the mean of our new variable, x — 56, is now at the origin, and 
standard deviation measures spread from the mean, we reduce the standard 
deviation to 1 by dividing each new mark by 18. So x — 56 becomes 
(x — 56)/18. Therefore (x — 56)/l8 is now a standardised Normal variable. 

So a mark of 70 becomes -—-= 0.78. 

18 

If we now look at our Normal probability curve, in geometrical terms the 
standardisation process can be regarded as two successive transformations: 

(i) a translation with vector 



(ii) a two-way stretch with factor \/a parallel to the x-axis and factor a 
parallel to the #-axis (so that the total area under the curve remains 
constant). 
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Hence the general standardised variable now becomes (* — /i)/cr. 

Writing the standardised variable (x ~ fi)/a as t , the standard Normal 
function is therefore given by: 

Since the p.d.f., is not integrable, in practice we use the equivalent 
cumulative probability function ®(f). 

In geometrical terms, <D(f) represents the shaded area on the diagram: 
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Values of®(7) are tabulated below: 


t 


t 


t 

o(o 

t 

0(0 

0.0 

0.500 0 

1.0 

0.841 3 

2.0 

0.977 2 

3.0 

0.998 65 

0.1 

0.539 8 

1.1 

0.864 3 

2.1 

0.982 1 

3.1 

0.999 03 

0.2 

0.579 3 

1.2 

0.884 9 

2.2 

0.986 1 

3.2 

0.999 31 

0.3 

0.617 9 

1.3 

0.903 2 

2.3 

0.989 3 

3.3 

0.999 52 

0.4 

0.655 4 

1.4 

0.919 2 

2.4 

0.991 8 

3.4 

0.999 66 

0.5 

0.691 5 

1.5 

0.933 2 

2.5 

0.993 8 

3.5 

0.999 77 

0.6 

0.725 7 

1.6 

0.945 2 

2.6 

0.995 3 

3.6 

0.999 84 

0.7 

0.758 0 

1.7 

0.955 4 

2.7 

0.996 5 

3.7 

0.999 89 

0.8 

0.788 1 

1.8 

0.964 1 

2.8 

0.997 4 

3.8 

0.999 93 

0.9 

0.815 9 

1.9 

0.971 3 

2.9 

0.998 1 

3.9 

0.999 95 

1.0 

0.841 3 

2.0 

0.977 2 

3.0 

0.998 6 

4.0 

0.999 97 


Two points should be noted about the use of such a table of cumulative 
probabilities: 

(i) Intermediate values 

More accurate statistical tables are generally available, but otherwise 
linear interpolation must be used to evaluate ®(£) for values of t not given 
in the table. However, the values of ®(7) so obtained should be regarded 
as accurate only to 3 decimal places for values of t ^ 3, e.g., 

®(1.37) » 0(1.3) + ii)[®(1.4) - 0(1.3)] 

« 0.903 2 + ^ x 0.016 0 
« 0.903 2 + 0.011 2 
~ 0.914, to 3 d.p. 

(ii) Negative values 

No negative values of t are given in the table since O(— t) can be calculated 
very easily from 0(/). 
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By symmetry, the two shaded areas on the diagram are equal. The areas 
represent 0( — t) and 1 — O(J), so it follows that: 

O(-0 = i - 0(0- 

So, for instance, 0(— 0 . 6 ) = 1 — 0(0.6) 

= 1 - 0.725 7 
= 0.274 3. 

The following two examples illustrate practical applications of the 
Normal distribution: 

Example 2 

Assuming that intelligence quotients are Normally distributed with mean 
100 and standard deviation 15, calculate: 

(i) the probability that a person chosen at random has an I.Q. between 
88 and 118; 

(it) the percentage of the population with I.Q. greater than 130. 

To standardise the variable in this case, we subtract 100 and divide by 15. 
Therefore: 

(i) p{88 < I.Q. < 118) 

= p{ I.Q. < 118) - p{ I.Q. < 88 ) 

118 - 100 \ /88 - 100 
15 j ~ \ 15 

= 0(1.2) - 0( —0.8) 

= 0(1.2) - [1 - 0(0.8)] 

= 0 ( 1 . 2 ) + 0 ( 0 . 8 ) - 1 
= 0.884 9 + 0.788 1 - 1 
= 0.673; 

(ii) p{l. Q. > 130) 

= 1 - p{ I.Q. < 130) 

130 - 100\ 

15 ) 

= 1 - 0 ( 2 ) 

= 1 - 0.977 2 

= 0.022 8 . 

So approximately 2.3% of the population have I.Q. greater than 130. 
Example 3 

Machine components have lengths which are Normally distributed with 
mean 57.2 mm. It is found that 13% of the components have lengths greater 
then 57.4 mm. 
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(i) What is the standard deviation of the component lengths? 

(ii) What length will be exceeded by 20% of the components? 

(i) We will denote the standard deviation by cr. 


Now />(length < 57.4) 

- 



1 — /?(length > 57.4) 
1 - 0.13 

0.87. 


But from the Normal probability table, 0(1.13) = 0.87. 

0.2 

So —- = 1.13 

a 

0.2 

=> cr = — = 0.18 mm (correct to 2 d.p.). 


(ii) Let the required length be denoted by X 

then />(length > X) = 0.2 

=> ^(length < X) = 0.8 

(X - 57.2\ 

x 0 . 8 . 

0(0.84) = 0.8 


x 0.18 = 0.15 
X — 57.35 mm. 


V 0.18 ) 

But (correct to 2 d.p.) 
X - 57.2 

=> ——— = 0.84 
0.18 

=> X - 57.2 = 0.84 


Exercise 12.3a 

1 The mean weight of 100 sixth-formers at a certain school is 63 kg and 
the standard deviation is 7 kg. Assuming that the weights are approxi¬ 
mately normally distributed, estimate how many sixth-formers weigh: 

(i) more than 70 kg; (ii) less than 73 kg; (in) less than 60 kg; (iv) between 
61 and 65 kg. 

2 The times taken by a large group of students to complete a project are 
approximately normally distributed, with mean 30 hours and standard 
deviation 4 hours. Estimate the percentage of students who: 

(0 spend more than 36 hours on the project; 

(ii) finish the project in less than 25 hours; 

(Hi) finish the project in less than 20 hours. 

3 A factory produces ball-bearings whose masses are normally distributed 
with mean 45.32 g and standard deviation 0.17 g. What is the probability 
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that a ball-bearing chosen at random will have a mass: (z) more than 
45.40 g; (ii) more than 45.50 g; (Hi) less than 45.00 g; (iv) between 45.20 
and 45.45 g? 

4 In a certain book the frequency function for the number of words per 

page may be taken as approximately Normal with mean 800 and standard 
deviation 50. If I choose three pages at random, what is the probability 
that none of them has between 830 and 845 words? (s.m.p.) 

5 The daily delivery of mail at a biological research station follows a time 
pattern conforming to the normal distribution, with a mean time of arrival 
at 8.40 a.m. and with a standard deviation of 20 minutes. 

Estimate the number of occasions during the 250 working days in the 
year when the mail arrives 

(i) before the main gates open, at 8.00 a.m.; 

(ii) after the arrival of the office staff, at 8.20 a.m.; 

(iii) during the Director’s daily meeting with heads of research sections 

(9.00 a.m.-9.20 a.m.). (a.e.b.) 

6 The heights in centimetres of a sample of 700 six-month old babies, 
attending a post-natal clinic, are given in the following frequency table: 


Height/cm 

62 

63 

64 

65 

66 

67 

68 

69 

70 

71 

72 

Total 

Frequency 

25 

35 

52 

84 

120 

135 

101 

61 

40 

33 

14 

700 


Calculate the mean and standard deviation of the heights. 

Assuming that the heights of all such babies are normally distributed 
about this mean with this standard deviation, estimate: 

(z) the percentage of all six-month old babies of height 70 cm or more; 
(ii) the height that will be exceeded by 60% of all babies of this age. 

7 The table shows the frequency f with which x a-particles were radiated 
from a source in a given time, the values of x being integers only. 


X 

0 

1 

2 

3 

4 

5 

6 

7 

8 

9 

10 

f 

30 

37 

71 

126 

130 

108 

67 

5 

3 

2 

1 


Calculate the mean value of x and its standard deviation. Assuming that 
the sample fits a normal distribution, find the value of x that will be exceeded 
on 70% of occasions. (m.e.i.) 

8 If a: is Normally distributed with mean /x and standard deviation <T, 
calculate the value of X such that: 

(z) p(\x — /x| < Xa) = 0.5; 

(ii) p(\x — n\ < Xct) = 0.9. 

9 Packets of soap-powder are filled in such a way that the masses of their 
contents are normally distributed with mean 520 g and standard deviation 
15 g. The cartons are nominally of 500 g. 
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(i) Calculate (correct to 1 d.p.) the percentage of packets that will be 
‘ under-weight ’. 

(ii) Calculate the percentage of packets containing more than 5% above 
their nominal contents. 

(iii) If the mean mass of soap-powder can be altered, without affecting 
the standard deviation, what should the new mean be to ensure that only 
2 % of packets are under-weight? 

10 Hens’ eggs have mean mass 60 g with standard deviation 15 g, and 

the distribution may be taken as Normal. Eggs of mass less than 45 g are 
classified as ‘small’. The remainder are divided into ‘standard’ and 
‘large’, and it is desired that these should occur with equal frequency. 
Suggest the mass at which the division should be made (correct to the 
nearest gram). (s.m.p.) 

11 In a cross-country race the times taken by competitors to complete the 
course are approximately normally distributed with mean 47 minutes and 
standard deviation 9 minutes. The race organisers wish to split the com¬ 
petitors, on the basis of their times, into 5 equal categories, very slow, slow, 
average, fast and very fast, so that there will be an equal number of com¬ 
petitors (as far as possible) in each category. 

(i) What is the least time in which a competitor must complete the race 
to be recorded as ‘ very fast ’ ? 

(ii) Between what times will a competitor be recorded as ‘average’? 

(iii) Between what times will a competitor be regarded as slow? (Give 
answers in minutes, correct to 1 d.p.) 

12 A machine produces components to any required length specification 
with a standard deviation of 1.40 mm. At a certain setting it produces to a 
mean length of 102.30 mm. Assuming the distribution of lengths to be 
Normal, calculate: 

(i) what percentage would be rejected as less than 100 mm long; 

(ii) to what value, to the nearest 0.01 mm, the mean should be adjusted 
if this rejection rate is to be 1 %; 

(iii) whether at the new setting more than 1% of components would 

exceed 107 mm in length. (m.e.i.) 

13 Machined components are accepted if they pass through a gauge of 

1.040 cm and do not pass through a gauge of 0.960 cm. It was found that 
over a period of production the percentages of components rejected by the 
larger and smaller gauges were 3.5 and 1.5 respectively. Assuming that 
the distribution of the dimension tested is normal, find the mean and the 
standard deviation. (o.c.) 

14 A manufacturer hopes, for the sake of his reputation, to produce 
an article of such quality that not more than 5% of the articles may be 
expected to last for less than 6 months; and for reasonable economy in 
manufacturing costs, that not more than 15% may be expected to last for 
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more than 20 months. He can control quality to give (assuming normal 
distribution) a pre-determined average fi and standard deviation o for the 
durability of his product. At what values of fi and a should he aim? (c.) 

15 Skulls may be classified into 3 types according to an index based on 
their length-breadth ratio: 

type A, under 75; 

type B, between 75 and 80; 

type C, over 80. 

A large number of skulls are examined and it is found that 58% are of 
type A, 38% of type B and 4% of type C. Assuming that the length-breadth 
ratio is normally distributed, determine the mean and standard deviation 
of the length-breadth ratio of the skulls. (c.) 

16 A man leaves home at 8.00 a.m. every morning in order to arrive at 
work at 9.00 a.m. He finds that over a long period he is late once in forty 
times. He then tries leaving home at 7.55 a.m. and finds that over a similar 
period he is late once in one hundred times. Assuming that the time of his 
journey has a normal distribution, before what time should he leave home 
in order not to be late more than once in two hundred times? (s.m.p.) 

17 In a particular school 1 460 pupils were present on a particular day. 
By 8.40 a.m. 80 pupils had already arrived, and at 9.00 a.m. 12 pupils 
had not arrived but were on their way to school. By assuming that the fre¬ 
quency function of arrival times approximates to Normal form, use tables 
to estimate: 

(i) the time by which half of those eventually present had arrived; 

( ii) the standard deviation of the times of arrival. 

If registration occurred at 8.55 a.m. how many would not have arrived 
by then? 

If each school entrance permitted a maximum of 30 pupils per minute to 
enter, find the minimum number of entrances required to cope with the 
‘peak’ minute of arrival. (s.m.p.) 

18 The length of a certain mass-produced item is a normal variable with 
mean 24.3 mm and standard deviation 0.4 mm. Items are rejected if 
their lengths are below 23.7 mm or above 25.2 mm. Calculate the propor¬ 
tion of items that are rejected, and the proportion of rejects that are too 
long. The machine making these items is adjusted with the intention of 
reducing the proportion rejected; the standard deviation of length cannot 
be altered, but the mean can. State what value of the mean leads to the 
minimum proportion of rejects, and give the value of this proportion, (c.) 

The limiting binomial distribution 

The Normal probability distribution has so far arisen as a suitable model to 
describe certain statistical data, drawn from a wide variety of sources. We 




In the first case, with/? = the distinctive normal bell-shape is immedi¬ 
ately apparent, even for relatively small values of n. And for n = 20 the 
outline of the probability diagram approximates very closely to the Normal 
curve. With a symmetrical binomial distribution perhaps this is not so 
surprising, but with an apparently asymmetrical distribution, such as we 
have with p = the result is much more striking. Indeed, for small values 
of n (e.g., n — 4), the probability diagram bears no resemblance at all to 
the symmetrical Normal shape; but when n = 10 we see that the diagram 
is much less skew, and when n = 20 we once again have a diagram whose 
outline is very similar to the characteristic Normal shape. It is not difficult 
to envisage that as n becomes larger and larger, and the horizontal and 
vertical scales are suitably adjusted (maintaining a total area of 1), the dia¬ 
gram tends closer and closer to the Normal curve. The two diagrams with 
n — 20 are reproduced below, with Normal curves superimposed. 
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We know, from section 12.1, that for a binomial distribution 
fi = np and o = yjnpq , 

which therefore gives us the values of /1 and a indicated on the diagram. 
Derivation of the Normal probability function 

We can now derive the equation, stated earlier in this section, of the 
Normal probability function, by regarding it as the limit of the binomial 
distribution. 

The mathematics involved in this derivation is not particularly easy, 
and it should not be regarded as essential to follow the working through in 
detail. 

We have seen that as n —> 00 the profile of the binomial distribution 
tends: 

(i) to move to the right (since pi = np); 

(ii) to spread (since a = yj(npq)); 
and (Hi) to become flatter. 

Now the first of these can be controlled (or standardised) by letting 
u — r — np, 

and the second by letting 
u r — np 

t = 

V (npg) V (npg) 

But this second transformation is equivalent to diminishing all widths by a 
factor 1 l-sjnpq] so if the area beneath the profile is to remain 1, each of the 
ordinates must be stretched by a corresponding factor f (npq) which (hope¬ 
fully) will also counteract the flattening. 
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By this means we can bring each distribution into a standard position 
with jj, = 0 and (7=1, and so can more easily investigate the alterations 
of its shape (rather than of its position and spread) as n —> go. 

If we now consider the two neighbouring ordinates p r and p r+1 of the 
original distribution, we see that each is stretched by a factor yj (npq), whilst 
the distance between them is reduced from 1 to l/^/(npq). Furthermore, we 
suppose that as these ordinates draw closer together, the distribution tends 
to one with probability density function p(t ): 



If the ordinates p r and p r+x are transformed into (f>(t) and (j)(t + St), it 
follows that 

4>{t) = y/(npq) p r and St = ,/ • 

V (npq) 

(j){t + St) = a / (npq) Pr+\ ■ 

Hence 5(f) = (j)(t + St) — (p(t) 

~ \J(npq) (Pr + 1 - Pr) 

d <t> ^ Pr+1 ~ Pr _ Pr+1 
=> —— ^ - -*— — 1 . 

<P Pr Pr 

But P r = i^jp r q n ~ r and p r +1 = ^ + 1 ^/ ,r+ Y“ r_ ’■ 









12-3 the normal distribution 


309 


So 


<t> 


n 

r + 1 


pr+l g n-r-l 


- 1 


W" 


- 1 


(” - r)p 

(r + 1 )q 

np — rp — rq — q np — r — q 


(r + 1 )? 


(r + 1 )q 


But 


So 


r = np + y]{npq)t and dt 


1 

y/(npq) 


w [- - q\s[ij}Pq) 

4> St [np + sj{npq)t + 1 ~\q 


K -np* ~ V ( n P9) 

np + y/ (npq)t + 1 

« — ~ V (?/”/>) 

1 + V (?/”/*)* + 1 ! n P 


Letting a — > go (and so dt —► 0), 
1 d0 

T -~ = -f 
(j) d£ 

=> In <f)(t) = —\t 2 -f- In c 


4>(t) = c e i(2 . 


Hence 


<p(t) d t = c 


-* 2 At. 


But 4>(t) dt = 1 , since (pit) is a p.d.f., and we have also seen (Miscel¬ 
laneous problems, chapter 9 ) that e~* 2 d.v = yjn, which implies that 
J-*, e _i,2 d< = yj{2n). 


Hence c = 


V(2 *) 

and the limiting curve is 



Finally, we can reverse the process of standardisation by the transforma¬ 
tion 

x - n 


G 
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and thereby obtain the probability density function of the more general 
Normal distribution as 


4>(x) = 


_ e ~( x ~ A*) 2 /2 < t 2 

s/{2n)cr 


We are now able to put the limiting property of the Normal distribution 
to practical use, as illustrated in the following example. 


Example 4 

Forty per cent of university mathematics students are female. Assuming 
that these young ladies are scattered randomly throughout the country, 
what is the probability that a mathematics department of 100 students 
contains at least 50 women? 

The probability model that describes this situation is a binomial one, 
with n = 100,/? = 0.4, q = 0.6. 

The required probability is therefore given by: 


p (at least 50 women) = p 50 + p 51 + * • * + Pi 0 o 
= f 1 ™) (0.4) 5 O (0.6) 5 ° 

+ (^i^) (0-4) 5 ‘(0.6 ) 49 + ••• + 



(0.4) 10 °. 


The evaluation of such a probability would be, to say the least, tedious. 
But to ease the calculation, we can now use the Normal approximation to 
this binomial distribution, with 


fi — np = 100 x 0.4 = 40 
and a = yjnpq — yj( 100 x 0.4 x 0.6) = 4.9. 

Now if we denote the number of female students by f then we require 
p{f> 50) = 1 49). 

Using the Normal distribution therefore it would seem reasonable to 
say that 


but this is not quite true and the discrepancy is illustrated in the diagram 
opposite. 

The shaded areas represent, on the left p(f ^ 49) and on the right 
<D[(49 — 40)/(4.9)]. The difference between them is fairly evident, and 
arises from the fact that we are using a continuous curve to approximate 
to a discrete probability distribution. What we need to do is to shift the 
boundary of the shaded area on our right-hand diagram half a unit to the 
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right. We are then considering $[(49.5 — 40)/(4.9)], which gives a 
better approximation to p(J ^ 49). Such a modification is known as a 
continuity correction . (For very large values of n the correction is insignificant, 
and so may be disregarded.) 




Hence p{f 49) = 4> (^^ 

= 0(1.94) 

- 0.974 

=> p(J Ss 50) = 1 - 0.974 = 0.026. 

So the probability of a department of 100 containing at least 50 women is 
0.026. 


For a general binomial distribution, where n is large, and the probability 
of‘success’ = p. 

(N + - - 

p (at most N successes) = $ [- - — 

V VW 



Exercise 12.3b 

1 If an unbiased coin is tossed 100 times, what is the probability that 
(i) there will be more than 60 heads/ 

(«) there will be at least 45 and at most 55 heads? 

(Hi) there will be fewer than 43 heads? 

2 If an unbiased coin is tossed 1 000 times, what is the probability that 
(i) there will be more than 600 heads? 

(«) there will be at least 450 and at most 550 heads? 

{Hi) there will be fewer than 520 heads? 

3 If a ‘fair’ die is thrown 300 times, what is the probability that 

(i) there will be more than 60 sixes? 

(ii) there will be fewer than 45 sixes? 
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4 In an examination which consists of 100 questions, a student has a 
probability of 0.6 of getting each question correct. The student fails the 
examination if he obtains a mark less than 55, and obtains a distinction for 
a mark, of 68 or more. Calculate: 

(i) the probability that he fails the examination; 

(ii) the probability that he obtains a distinction. 

5 If it is known that 30% of people are short-sighted, what is the proba¬ 
bility that more than 33% of a sample, chosen at random, will be short¬ 
sighted if: 

(i) the sample consists of 100 people; 

(ii) the sample consists of 1 000 people; 

(Hi) the sample consists of 10 000 people. 

6 It is known that 72% of TV viewers watch a particularly popular pro¬ 
gramme. What is the probability that in a sample of 500 viewers, chosen at 
random, 

(i) more than 350 watch the programme? 

(ii) more than 375 watch the programme? 

(Hi) fewer than 340 watch the programme? 

7 A playing card is drawn at random from a full pack, containing no 
jokers, its suit is recorded and it is then returned to the pack; the pack is 
then shuffled and the procedure repeated. In 60 such draws calculate, using 
a Normal approximation, the probability that: 

(i) no more than 12 hearts appear; 

(ii) more than 20 spades are obtained. 

8 A confectionary firm produces three types of toffees, liquorice, nut and 
plain, and mixes them together in the ratio 1:2:5 before packing them 
into boxes. If there are 80 toffees to a box, what percentage of boxes will 
contain: 

(0 more than 25 nut toffees? 

(ii) less than 58 plain toffees? 

(Hi) more nut and liquorice than plain toffees? 

9 In a certain manufacturing process a 10% rate of defectives is regarded 
as just tolerable. It is decided to accept a day’s batch if in a random sample 
of size n taken from it the proportion of defectives does not exceed 12 %. 
Calculate the value of n if there is a 0.05 probability of rejecting a batch 
which is in fact producing 10 % of defectives. 

The percentage of defectives rises to 14%. The procedure described 
above is used with a sample of size 600; calculate, to 2 decimal places, the 
probability that as a result of the test the batch is accepted. (m.e.i.) 


The central limit theorem 

A computer may be used to produce the binary digits 0 and 1 in a random 
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fashion. Then if x denotes the value of the digit so obtained, 

p(x = 0) — p(x = 1) = 



■x 


For this distribution, 

ju x = 2 x 0 + 2 x 1 = 0.5 

and a 2 = |(1 - |) 2 + £(0 - |) 2 = 0.25 

=> o x — 0.5. 


Now suppose that instead of looking at the digits individually, we group 
them together in sets of, say, 10. And for each group we will consider a 
new variable, y , the sum of the digits. 

y can therefore take values from 0 to 10, and since each digit can have 
only one of two values, y has a binomial probability distribution with 
n — 10,/> = £ = 2* 


Hence = np = 5 
and 


<7 y = sjnpq - yj2.b. 



y represents the sum of 10 digits. We now define a third variable, z, to be 
the average of each group of digits, so that z = yq y • 

Since z is a constant fraction of its probability distribution will have 
the same shape as that of y, shown in the diagram, but the horizontal scale 
will be reduced by a factor 10. 

So 

and 

Hence 


Hz = To Py = 0.5 

i i /ft c 0.5 
<? z = ro<7 y = lV 2 * 5 = 

, <r x 

= Mx and <T Z = 
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Therefore if we consider the averages of groups of 10 digits, the expected 
value is the same as that of the digits taken individually, but the standard 
deviation of the averages (or ‘sample means’) is a fraction l/^/lO (i.e., 
about j) of the standard deviation of the individual digits. 

Moreover, despite the fact that the original distribution was rectangular, 
the distribution of sample means appears to be approaching the Normal 
distribution. This represents a special case of one of the most important 
and remarkable theorems in statistics, known as the central limit theorem , 
that if samples of size n are drawn at random from any background popula¬ 
tion then, as n —» 00, the distribution of the sample means tends to the 
Normal distribution. (Although we generally refer to the sample means in 
the central limit theorem, the theorem clearly also holds for the sample 
sums.) 

Having already discovered the limiting property of the binomial dis¬ 
tribution it is perhaps hardly surprising that the theorem should hold for 
our example of the binary digits. But suppose we consider the sample 
means of sets of 10 random digits in the range 0-9; the frequency diagram 
below illustrates the sample means, suitably grouped, of 50 samples taken 
from a table of random digits. 



The sample means in this case do not have a distribution in any way bino¬ 
mial; nevertheless, as predicted by the central limit theorem, the Normal 
shape of the distribution is again apparent. 

So, in yet another way, is emphasised the importance of the Normal 
distribution. 


Exercise 12.3c 

1 Using a table of random digits (0-9), obtain the sample means of 100 
sets of ten digits. Display these sample means on a frequency diagram 
similar to that shown in the text. 

2 What would have been the values of the expected value and standard 
deviation of the sample mean of the binary digits, described in the text, if 
the sample had been: (i) of size 25; (it) of size 100? 
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3 A computer is now programmed to produce randomly the digits 0, 2. 
Calculate the values of /t and a for this simple rectangular distribution, 
and hence obtain the expected value and standard deviation of the mean 
of a sample of size 10 . 

4 The digits 1, 2, 3, 4 are produced randomly. By applying the results 
obtained for binary digits, calculate the expected value and standard 
deviation of the mean of a sample of 100 such numbers. Hence, using the 
central limit theorem, estimate the probability that the mean of such a 
sample exceeds 2.7. 


12.4 The Poisson distribution 

In 12.3 we discovered that, for large values of n , the Normal distribution 
gave us a good approximation to the binomial distribution, and is in fact 
the limit of the binomial distribution as n —> 00. 

Consider, however, the binomial probabilities with n = 1 000 and 
p — 0.001 and their Normal approximations. 



Here, despite the fact that n is very large, the Normal distribution seems 
to give us a poor approximation. This apparent contradiction of our previ¬ 
ous result arises from the fact that in section 12.3 we assumed that as n 
became very large, so did the mean, ju(= np), with the effect that the 
binomial distribution became approximately symmetrical about this mean. 
For the distribution illustrated above, however, this is not so; for, even 
though n is large, 

fi = np = 1 000 x 0.001 = 1 , 

a relatively small value, and the distribution cannot be symmetrical 
about ill. 

In fact the limiting Normal property of the binomial distribution holds 
only if 
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H = np oo as ?z —>oo 


(an assumption which was made in the last section). In many cases, of 
course, this is so, but if np is constant as n —► oo, then the limit of the 
binomial is quite different, as we shall now show. 


We know that for the general binomial distribution, 


Pr= I m\~P) n 

, T ) 


n(n — 1). . . (ft — r + 1) r 


/(l - PY~ r 


ii- 


i - 


r - 1 


-p r (i -pr r 


-U- 


r - 1 


ft J r\ 
r - 1 \fi r 


■)—(1 - p) n - r 


ft j r\ 

Now since r is finite, as n —> oo 


i -!]•••( i - 

ft 




r - 1 


1 


and 1 - - = 1 - (ft - r) - + 

ft 

(a finite series) 


li (ft — r)(« — r — 1) /z 2 


2 ! 


ft — r (ft — r)(ft — r — 1) u 2 
= 1 -M H- 2 -i77 

ft ft 2 ! 


, /^ 2 /^ 3 
~ ^ + 27 “ 3 ! + 


(an infinite series) 


Therefore, in this limiting case, 



and the distribution so defined is known as the Poisson probability distribution. 

The distribution can also be obtained by considering its probability 
generator: 

The probability generator of the general binomial distribution is given by 
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G (t) = (q+ pt) n 

= (1 + p(t — l)) n since p + q = 1 

A*(* - 1)V 


= i + 


since fi — np. 


Now we know (Miscellaneous problems, chapter 9) that if* is finite, 

1 *Y X 

1 H— I —► e as n —> 00. 
n) 

So G(t) — > 1} as n —► 00. 

Now = e* x e-* 

.(, + „ + ^ + ... + -2 + . 

= e~* + pe-»t + ... + fL e -»f + . 


l/ 

so the coefficient of f gives p r = -y e 


Returning to the original binomial distribution, if we now use the 
Poisson distribution with fi = 1, we obtain probabilities which are very 
good approximations to the binomial, and these equivalent binomial and 
Poisson probabilities are set out in the table below. 


Probability 

Binomial 

Poisson 

Po 

0.367 7 

0.367 9 

Pi 

0.368 1 

0.367 9 

Pi 

0.184 0 

0.183 9 

P 3 

0.061 3 

0.061 3 

P 4 

0.015 3 

0.015 3 

P5 

0.003 0 

0.003 1 

Pe 

0.000 5 

0.000 5 


So we now have, under different circumstances, another limit to the 
binomial distribution, but with the important difference that this limiting 
distribution, like the binomial itself, but unlike the Normal distribution, 
is discrete. 

The probability diagrams for the Poisson distribution with fi = 0.5, 
2 and 5 respectively, are illustrated overleaf. 

It can be seen from these diagrams that, as n becomes larger, the Poisson 
distribution becomes more symmetrical in shape and, like the binomial, 
approximates to the Normal distribution. 
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Calculation of Poisson probabilities 

In the actual computation of probabilities, the Poisson distribution has 
one big advantage over the Normal and binomial distributions; it involves 
the use of only one parameter, //, as opposed to two in each of the other 
cases (fi, a for the Normal and n , p for the binomial). And though the 
formula for p r may seem at first glance rather daunting, in practice the 
calculation of probabilities can be carried out very conveniently: 

Suppose jU = 3, 

then p 0 = e~ 3 = 0.049 8 

p x = 3*~ 3 = 3p 0 = 0.149 4 

3 2 *“3 

p2 = —— = I Pi = 0-224 1 

3 3 e -3 

P 3 = —— = IP2 = 0.224 1 
3 4 e~ 3 

P 4 = = 4^3 = 0.168 1 

and p r+l =—*—p r (so in general p r+l =-~—p r ). 

r + 1 \ r + 1 / 

Mean and variance of Poisson distribution 
For the binomial distribution, 


ji = np 

and o 2 = npq = np( 1 — p). 

Now we can regard the Poisson distribution as the limit of the binomial 
distribution as n —> oo and np = fi (so that p —> 0 ). 

mean = // 

variance = lim np( 1 — p) = fi. 

n -* oo 


Hence 

and 
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Applications of the Poisson distribution 

We have derived the Poisson distribution as a limiting case of the binomial 
distribution. But it arises more fundamentally in its own right, whenever 
there is a continuous variable — say time or distance — and discrete events 
occur randomly and independently. In such a case, the number of events in a 
given interval has a Poisson distribution. (By imagining the continuous 
variable split into a very large number of equal small intervals, the reader 
can verify for himself that these conditions are consistent with those of a 
limiting binomial distribution.) Examples of such situations, in which the 
Poisson distribution is applicable, are provided by 

flaws occurring in lengths of steel piping, 
people arriving in a shop queue in a certain time interval, 
and radio-active particles being emitted from a source in some time interval. 
Consider the first of these examples: 


Example i 

It is found experimentally that there is an average of one flaw in every 
2 metres of a certain type of steel piping. If the piping is cut into 1-metre 
lengths what is the probability of 0, 1, 2 flaws in such a length? Find the 
equivalent probabilities if the pipe is cut into 4-metre lengths. 

( i ) For a 1-metre length of pipe, the expected number of flaws, [1 = 2 - 
Hence p Q = e~* = 0.606 5. 

Then using the relationship we have recently obtained between succes¬ 
sive Poisson probabilities, 

pi = \p 0 = 0.303 2 

and p 2 =^-= 0.075 8 . 

F2 2 

( ii) For 4-metre lengths of pipe, (1 — 2. 

Hence p 0 = e~ 2 = 0.135 3 

=> Pi — 2 /> 0 = 0.270 6 

=> p 2 =- 1 1 = 0.270 6 . 

It was quite clear that Example i should be tackled by means of the 
Poisson distribution. By comparison, in the following example we can use 
either a binomial or a Poisson probability model, and the equivalent 
results for the two models are compared. In example 3 we fit a Poisson 
model to experimental data. 
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Example 2 

A firm of wholesale fruit distributors has found that on average one apple 
in 50 is bruised on arrival from the growers. If the apples arrive in cartons 
of 100, calculate the probabilities of a carton having 0, 1,2, 3, more than 
3 bruised apples. 

Using a binomial model, n = 100 and p = 

Using a Poisson model, jx = 100 x = 2. 

The calculations for the two models are carried out simultaneously: 
binomial Poisson 


Po ~ 
Pi = 
P 2 = 
P3 = 


'49V 00 

50/ 

100\ /49V 9 
1 )\50j 50 

'100\ /49\ 9 8 / 1 \ 2 
^ 2 ) 1/30/ \50/ 

100\ /49 V 7 

3 / 1/50/ 1/50/ 


p(> 3) 


= 0.131 

6 

Po 

= 0.270 

6 

pi 

= 0.272 

7 

pi 

= 0.181 

8 

P3 

= 0.143 

3 

P( 


= e~ 2 = 0.135 3 
= 2p 0 = 0.270 6 
= p x = 0.270 6 
= %p 2 - 0.180 4 
>3) = 0.143 1 


The results clearly are very similar but, since its calculations are so much 
easier, we would usually adopt the Poisson model. 


Example 3 

The following classical data (without which any account of Poisson proba¬ 
bility would be incomplete!) gives the number of deaths by horse-kick of 
Prussian cavalrymen in ten corps during the years 1875-1894. 


No. of deaths per corps per year 

0 

1 

2 

3 

4 

Frequency of occurrence 

109 

65 

22 

3 

1 


We will fit a Poisson probability model to the data, and calculate the equi¬ 
valent theoretical frequencies. 

The mean number of deaths per year is given by 




m 
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109 x0 + 65 x1+ 22 x2 + 3x3+1x4 
200 

122 
200 

= 0.61. 

We will use this mean value as our parameter fi in the Poisson distribution. 
To obtain the theoretical (or expected) frequencies, we multiply the 
probabilities by the total frequency, 200 , and round off to the nearest whole 
number. 


No. of 
deaths 

Probability 

Expected 

frequency 

Observed 

frequency 

0 

e " 0 - 61 = 0.543 4 

109 

109 

1 

0.61 p 0 = 0.331 5 

66 

65 

2 

0.61 

— A = 0.101 1 

20 

22 

3 

0.61 

- p 2 = 0.020 6 

3 r2 

4 

3 

4 

0.61 

—— p 3 = 0.003 1 
4 

1 

1 


* Alternative derivation of Poisson distribution 


To complete our account of the Poisson distribution, we now derive it 
directly from our basic assumptions. 

Let us suppose that a series of events, such as the flashes of a Geiger 
counter, occur independently; and yet, over a long interval, at a uniform 
rate, so that in an interval of unit length the expected number of flashes is X. 
Then in a very short interval St, the probability 

of a single flash is XSt, 
of two or more flashes is 0 , 
and of no flashes is 1 — XSt. 


We now denote the probability of having r flashes in the first t units of 
time by p r (t). Then, if we consider a further small interval St , it is clear that 
p(r flashes in time t + St) 

= p(r flashes in time t and no flashes in interval St) 


+ p(r — 1 flashes in time t and 1 flash in interval St) 
p r (t + St) = p r (t) (1 — X St) + p r _ft)X St 


St 
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Letting St —> 0, we obtain 

Pr( 0 + ^r(0 = tyr- 1 (0 
d 




l/'AM] = A4-i(0 


U A >r(0]o = A [ 4-lW d/. 


But /> r (0) =0 if r 0, 

df. 


so e kt p x 


r(t) = A J\ 


( 1 ) 


Furthermore, 
p (0 flashes in time t + (5/) 

= p (0 flashes in time t and 0 flashes in interval St) 
=> /> 0 (< + St) = p 0 (t) (1 - A St) 

p 0 (t + St) - p 0 (t) 


St 

Letting St —► 0, 
p' 0 {t) + Xp 0 {t) = 0 
d 


+ Xp 0 {t) = 0. 


dt 


[>%«] = o 

«%(*) = A>(0). 


But we certainly start with 0 flashes, so/> 0 (0) = 1. 

Hence e kt p 0 (t) = 1. 

If we now put P r (t) — e kt p r (t ), we can rewrite equations (1) and (2) as 


( 2 ) 


P rW = ^ f p r-l 

Jo 


(t) dt 


and P 0 (f) = 1. 
So P l (t) = X 


dt = Xt 


P 2 (t) — X I Xt dt = 


X 2 t 2 


ft 

= X - 

jo 


n f X 2 t 2 J X 3 t 3 

P 3 (t) — X \ 2 dt — ^ 


fii(t) = Xte~ u -, 
} 2 f 2 

P 2 (t) = 

Pt 3 

ps(t) = -g-«-"; 


and, more generally, 


m = ™ 

r\ 


AY 


AW = —;- 

rl 


-xt 
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Finally, regarding t as fixed, we can write p r (t) as p r \ and since the 
expected number of flashes in this interval is At, we write ju = At and obtain 



Exercise 12.4 

1 It is known that 0.1% of all people react adversely to a certain type of 
drug. What is the probability that out of a sample of 1 000 people: 

(i) none will react to the drug; 

(ii) just one person will react; 

(Hi) more than two people will react? 

2 A hiker on a walking holiday in a country area estimates that in walking 
150 kilometres he has passed a total of 60 public houses. Assuming that 
these public houses are distributed fairly randomly about the countryside, 
what is the probability of his being able to quench his thirst: 

(1) within the next kilometre; 

(ii) within the next five kilometres? 

3 During a certain period of the day, an average of four people enter a 
supermarket each minute. Calculate the probability that : 

( 2 ) no-one enters during a particular minute; 

(ii) three people enter during a particular minute; 

(in) five people enter during a particular minute; 

(iv) no-one enters during a two-minute period. 

4 Over a period of time, the number of serious road accidents in a city 
averages 1.8 per day. What is the probability of there being: 

(i) no accidents on a given day; , 

(ii) more than two accidents on a given day; 

(in) fewer than five accidents in a three-day period? 

5 The average proportion of bad eggs in an egg packing station is one in 
2 000. The eggs are packed in boxes containing six eggs each. 

(i) Evaluate to two significant figures the probability that a box contains 
one bad egg. 

(ii) A housewife complains if she obtains two or more boxes, with one bad 
egg each, per 100 boxes. What is the probability that she complains? 

(m.e.i.) 

6 A manufacturer of electrical components knows that he can expect 
0-2% of his products to be faulty. What is the maximum number of com¬ 
ponents he can pack into a box, if he wishes to ensure that at least 90% of 
the boxes contain no defectives? 

7 During the Second World War 535 flying bombs fell on South London. 
The distribution of the numbers of hits in 576 areas, each of 0.25 km 2 is 
given in the table. Show that the aim was effectively random within the 
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South London area by calculating the expected frequencies for a Poisson 
distribution with the same average number of hits. 


No. of hits 

0 

1 

2 

3 

4 

5 

6 or more 

Frequency 

229 

211 

93 

35 

7 

1 

0 


(m.e.i.) 

8 The incidence of plumbing repairs in 78 council houses over a period 
of ten years is given in the table. Do the data support the view that there 
are good and bad tenants or the view that there are lucky and unlucky 
tenants? 


No. of repairs 

01 2 3 456789 10 

No. of houses 

3 13 16 16 10 9 3 5 1 1 1 


(Use a Poisson probability model and compare theoretical and observed 
frequencies.) (m.e.i.) 

9 At a stage in the mass production of lamp holders, random samples, 
each of 40 articles, are examined and the number of defective articles 
recorded. The numbers of defective articles in each of 200 samples are 
shown in the following frequency table: 


Defectives 

0 

1 

2 

3 

4 5 6 7 

No. of samples 

29 

56 

42 

42 

23 7 0 1 


Find the mean number of defectives per sample and the variance. Give 
reasons for thinking that the distribution approximates to a Poisson dis¬ 
tribution. 

Show that on this assumption there is a probability of about 5% of a 
sample containing more than four defectives and a probability of less than 
1% of a sample containing more than six defectives. (o.c.) 

10 Spot blemishes occur randomly along a steel wire. The number 
counted in consecutive centimetre lengths had the following distribution: 


No. of spots 

0 

1 2 3 4 5 6 7 

Frequency 

102 

150 112 56 21 6 2 1 


If the count had been over consecutive 2 cm lengths, estimate the fre¬ 
quency of occurrence of 3 spots per 2 cm. (You may assume that the fre¬ 
quencies are close to a Poisson distribution.) (s.m.p.) 

11 The following are the numbers of breakdowns of a machine in 40 
12-hour periods: 

0401100220110110132021221100200 
1 0 2 1 1 3 1 0 2. 
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Calculate the mean number of breakdowns per 12-hour period and, using 
this value for in a Poisson probability model, calculate the probabilities 
ofO, 1, 2, 3, 4 breakdowns. Hence estimate the number of breakdown-free 
days in a year (taken to consist of 300 12-hour working days). Also estimate 
the number of days in the year when there will be more than 2 breakdowns. 

12 Using probability generators, show that if x, y have independent 
Poisson distributions with means /i x , n y then x 4- y has a Poisson dis¬ 
tribution with mean j i x + fi y . 

13 Instruments A and B are set to record radiation from separate and 
independent sources. The number of particles recorded by the instruments 
in one second have Poisson distributions with means 2 and 4 respectively. 
Calculate the probabilities that: 

(i) at least one particle is recorded on one of the two instruments in a 
second ; 

(«) a total of 4 or more particles is recorded on the two instruments in a 
second. (o.c.) 

14 A factory produces high-quality china dishes. Flaws occur in the 
dishes themselves with a Poisson probability distribution, mean //, and 
flaws occur independently in the glazing, again with a Poisson probability 
distribution with mean (A. A dish is immediately rejected if it has a flaw of 
either kind. What is the probability that a dish is rejected? Hence find the 
probability that a rejected dish only had flaws in the glazing (expressing 
your answer in as simple a form as possible.) 

15 In a simplified probability model of the service in a barber's shop it is 
supposed that all haircuts take exactly six minutes and that a fresh batch of 
customers arrives at six-minute intervals. The number of customers in a 
batch is described by a Poisson probability function, the mean number 
being 3. Any customer who cannot be served instantly goes away and has 
his hair cut elsewhere. The shop is open for 40 hours a week. Calculate the 
theoretical frequencies with which batches of 0, 1, 2, 3, 4, 5 and more than 
5 customers will arrive per week. 

The proprietor reckons that it costs him £25 a week to staff and maintain 
each chair in his shop, and he charges 25 p for each haircut. Calculate his 
expected weekly profit if he has (i) 3, (ii) 4, (Hi) 5 chairs. (s.m.p.) 

16 Show that when an event occurs randomly in time (i.e., the number of 
events occurring in a fixed interval of time has a Poisson distribution — with 
mean X) then the time interval between successive occurrences has the 
exponential distribution: 

</>(x) = X e~ Xx (x > 0). 

The mean number of calls per minute made on a certain telephone 
exchange is two. Find: 

(i) the probability that in any given one minute interval not more than 
one call will be made; 
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(ii) the probability that in any given half-minute interval exactly two 
calls will be made; 

( Hi) the probability that there will be no call in the half-minute immedi¬ 
ately following a call. (m.e.i.) 


*12.5 Samples 

In practical statistics we rarely consider individual readings or measure¬ 
ments in isolation; the normal procedure is to use a sample , or set of readings, 
and from these readings to obtain a statistic , or representative value. The 
most common, and generally most useful, statistic is the mean . Intuitively 
we expect it to give us a good approximation to the expected value (or 
mean), ju, of the background population, and we also feel that somehow the 
mean is a more reliable estimate of fi than is a single reading. We will now 
give some substance to these intuitive notions by extending our ideas of 
expectation to cover the expected value and variance of the mean of a 
sample. But before examining the mean itself, we must first establish some 
basic results in expectation algebra. 


Expectation algebra 

Suppose that x is a random variable,! and takes values 
x 1 , x 2 , , • • • 

with probabilities p x , p 2 , p 3 , . . . . 

We now (as in chapter 7 ) use the notation E[x ] for the expected value , or 
expectation of x, so that 

E[x ] ='Z x <Pi = /*• 

Furthermore, we denote the variance of* by V\_x~\ so that 

F[x] = 


Hence 
and 

We now let a be constant, so that ax takes values ax x , ax 2i . . . with 
probabilities p 1 , p 2 , . . . . 

Hence E[ax~] = £ ax t pi 
= a £ x,pi 
= a E [a] . 

| It can be argued that what is generally referred to as a random variable is neither random nor 
variable. At this stage, however, it is undesirable to become too involved in such discussions, 
and since the term is in widespread use we shall continue to employ it. 


Z (*I - l 1 ) 2 pi = Z %2 iPi ~ V 2 

E[_(x - ju) 2 ] = £|> 2 ] - n 2 . 


E[x] = [i 

V[x] = EUx - /*) 2 ] = £[x 2 ] - a t 2 
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Similarly V[ax~\ — £ (ax t — aii) 2 p i 

= 5> 2 (* f - F) 2 />i 

= « 2 1 (*j - ^) 2 /'i 
= a 2 F|>]. 

Summarising, 

E\ax ] = aii[Y] 

V[ax] = a 2 F|>] 


The same results can be shown to hold for a continuous variable, using 
the probability density function, and integration instead of summation. 

Example 1 

If measurements are made in centimetres, the expected value and variance 
of length of leaf of a certain type of rhododendron are 5.5 cm and 2.4 cm 2 
respectively. What will the expected value and variance be if measurements 
are made in millimetres? 

Let x denote the leaf length in centimetres, 

then E[x] = 5.5 and V[x] = 2.4. 

The length in millimetres is given by lOx, so the expected length in milli¬ 
metres is given by 2? [10*] 

and £[10*] = 10£[*] 

= 55. 

Similarly F[l(k] = 10 2 T[x] 

= 240. 

These results of course are hardly surprising, and simply fortify the 
above expectation results with common sense. 


Two random variables 

Let x, y be two random variables, each of them discrete. (An equivalent 
analysis is also possible for continuous variables, but it involves the use of 
more advanced mathematics, with which the reader is probably not yet 
acquainted; it can be found in most specialised probability texts.) 

Let x take possible values x x , x 2 , . . ., x t , . . . 
and y take possible values y x , y 2 , • • •, . . . 

(a finite or infinite number in each case.) 

We define the joint probability distribution of x and y by p{x = x t ,y = yj), 
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which can be written p(x t , yj) or more simply/)^. 

Then p(x { ) —YuPij^ summing over all possible values of j, 

j 

and p(yj) = £ p ijy summing over all possible values of i. 

i 

This situation can now be summarised in the table: 


X \ 

y\ 

y 2 • 

• • »j ■■■ 


Xl 


p 12 

Pu 

P( x 1 ) 

x 2 

P21 

P22 

p 2 j 

pi x 2 ) 

x i 

Pn 

Pi 2 

Pij 

Pi x i ) 


P(yi) 

P(y 2 ) • 

■ ■ Piyj) ■ ■ ■ 

1 


Using this notation, let us now investigate the expected value and variance 
of x + y, the sum of the two random variables. 

Considering the values of x T y which arise from all the combinations of 
values of x and y, 

E[x + y] = £ I (*, + yj)p t j 

= X X x iPij + X X yjpij 

* j i j 

= X(*< (X Pa)) + X Oj (X Pij)) 

» J J i 

= X x iP(*i) + X yjPisj) 

* j 

= £|>] + Ely']. 

Also E[xy] = 22 Ptj*t!fj- 

i J 

But x, y are independent =► p tj = p{x t )p(yj) 

=*" E \. x y\ = 2lP( x i)P(yj) x iyj 

i J 

= 2 2P(yj)yj 

= £[*]£[*]. 
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Moreover, V[x + y] = E[(x + y) 2 ] — (E[x + y\) 2 

= E[x 2 + 2 xy + y 2 \ - (E[x] + E[y]) 2 

= {. E[x 2 ] - (. E[x ])2} + {E[y 2 ] - (E[y]) 2 } 

+ 2{£[*y] - E[x]E[y]} 

= K[*] + K[y] + 2C*t,[*,y], 

where Cofl[;e,y] = E\xy] — and is known as the covariance of 

x and y , which is a measure of their interdependence. 

In particular, we have seen that x , y are independent 

=> £[*y] =£[*]£[*]. 

=> C^lXy] = 0 

=> V[x + y] = V[x] + V[y\. 


Summarising: in general 
but if x, y are independent 


E[x + y] = E[x\ + £[y] 

V\_ x + y] = V[x] + F[y] + 2 Cw [*, y] 
V[x + y] = V[x] + V[y] 


Similarly, E[x — y] = £M — £[y] 

H> - y] = K[x] + K[y] - 2 C<w [*, y]; 

and note that for independent x , y, 

- y] = K[x] + K[y]; 

so that the difference of two independent variables has exactly the same 
variance as their sum. 


Example 2 

A normal ‘fair’ 6-faced die is thrown together with a ‘fair 5 tetrahedral 
(4-faced) die. What is the expected value and variance of the total score 
registered on the two dice? (For the tetrahedral die, the score registered is 
that of the face on which it lands.) 


Let the score on the 6-faced die be denoted by x; 

then £M = ^(l + 2 + 3 + 4 + 5 + 6) 

= 3.5, 
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and V[x] = i(l 2 + 2 2 + 3 2 + 4 2 + 5 2 + 6 2 ) - 3.5 2 
= ^ - 3.5 2 
- 15.17 - 12.25 
= 2.92. 

Now let the score on the 4-faced die be denoted by y 

then E[i/] = i(l +2 + 3 + 4) 

= 2.5, 

and T[y] = ^(l 2 + 2 2 + 3 2 + 4 2 ) — 2.5 2 
= ¥- 2.5 2 
= 7.5 - 6.25 
= 1.25. 

Now the total score is given by x y 

and E[x + ^] = E\_x~\ + E[y] 

= 3.5 + 2.5 

= 6 

and if we assume that the two scores are independent, then 

V[x + y~\ — V\_x ] + F[y] 

= 2.92 + 1.25 
= 4.17. 

So the expected value and variance of the total score are respectively 

6 and 4.17. 


Example 3 

A firm which produces garden tools manufactures the blades and shafts 
(including handle) of its spades separately. The blade lengths (up to the 
point of attachment to the shaft) are approximately Normal, with mean 
30 cm and standard deviation 1 cm, and the lengths of the shafts are also 
Normally distributed with mean 78 cm and standard deviation 5 cm. 
What percentage of spades, when assembled, have lengths exceeding 
110 cm? 

We will assume (without proof) that the sum of two Normally dis¬ 
tributed variables is itself Normal. We will denote the length of the blade 
by x 9 and the length of the shaft by y. Therefore x + y denotes the total 
length of the spade. 

Hence E[x + y~\ = E[x\ + E\jf\ = 30 -(- 78 = 108 
and Vlx + y] = T[>] + T|>] = 5 2 + l 2 = 26. 

So the mean total length = 108, and its standard deviation = yj 26 = 5.1. 
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Now since we have assumed that x + y is Normally distributed, 


p(x 4* y > 110) 


= 1 



= 1 - 0(0.392) 

= 1 — 0.65 (to 2 d.p.) 
= 0.35. 



Therefore 35% of the spades are longer than 110 cm. 


Exercise 12.5a 

1 Prove that, if a is a constant, 

(i) E\x + a] = E[x ] + a ; 

(ii) V[x + a] = V\x]. 

2 If x and y are random variables, and a , b are constants, obtain expres¬ 
sions, in terms of E[x], E\_y], V[x] and V[y] for E\_ax + by] and V[ax + by] : 
(z) when x, y are independent; 

(ii) when x, y are not independent. 

3 Extend no. 2 to three mutually independent random variables x, y , z 
and constants a, b, c to obtain expressions for: 

(i) E[ax + by + cz]\ 

(ii) V[ax + by + cz]. 

4 Find the expected total score obtained when three standard, ‘fair 5 dice 
are thrown together. What is the variance of this score? 

5 Two standard , 4 fair 5 dice, one blue, the other red, are thrown together, 
and a total score is obtained by doubling the number shown on the blue 
die and adding to it the number on the red die. What are the expected 
value and variance of this score? Compare your results with those obtained 
in question 4. 

6 Two standard ‘fair’ dice are thrown, and their scores recorded. What 
are the expected value, and standard deviation of the difference between 
the two scores? 

7 Alan and Bob play a game in which Alan uses a 6 -faced die and Bob a 
4-faced die (as described in the text in example 2 ). To make the game 
‘fair’, they decide to double all Alan’s scores and treble Bob’s. Would you 
expect this to lead to a fair game? If not, who has the advantage? Evaluate 
the variance of each (increased) score on the single throw of a die. 

8 Prove that if x, y are independent, E\xy] = E\_x] x £[z/]. Hence cal- 
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culate the expected product of the scores obtained on two standard un¬ 
biased dice. 

9 If x and y are independent random single digit numbers other than zero, 
calculate the variance of 

(0 x; (ii) x + y\ (iii) 2x ; (iv) 2x — y. (s.m.p.) 

10 A man travels from his London office to his home by a tube journey 
from station A to station B. His walking times to A and from B add up to 
5 minutes with negligible variation, the variable factors in the journey 
being as follows, measured in minutes: 



Mean 

Standard 


time 

deviation 

(i) waiting for a train 

8 

2.6 

(ii) train journey 

47 

1.8 


Assuming that these two factors are independent and normally distributed, 
find the mean time and standard deviation of his whole journey. 

Estimate the probability of the whole journey taking 

( i ) less than 52 minutes; 

(ii) more than 65 minutes; 

(iii) between 57 and 62 minutes. (m.e.i.) 

i i Three independent components are placed in an electrical circuit in 
such a way that, as soon as the first fails, the second (reserve) component is 
automatically switched in, and then when the second component itself 
fails, the third is automatically switched in. The circuit functions as long 
as one of the components works. If the lifetimes of the components are 
Normally distributed with mean 2 000 hours and standard deviation 300 
hours, what is the probability that: 

(i) the circuit will function for more than 6 500 hours; 

(ii) the circuit will break down within 5 000 hours? 

12 If x t , x 2 , . . ., x n are independently distributed variables with means 
, /t 2 ,. . ., and standard deviations a 1 , cr 2 , . . ., a n respectively, write 
down the mean and standard deviation of y = A 1 x 1 T X 2 x 2 -+-•••+ k n x n 
where , A 2 ,. . ., A„ are constants. 

In the manufacture of certain metal tubes there are three distinct pro¬ 
cesses. The mean times for each process and the standard deviations of 
these times are given in the following table: 



Mean 

Standard 


time/s 

deviation/s 

Process 1 

60 

6 

Process 2 

10 

1 

Process 3 

40 

5 
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During the course of its manufacture each tube is subjected to processes 
1 and 3 once each, and to process 2 five times. Assuming that the times 
spent at each stage are distributed independently, calculate the mean and 
standard deviation of the total time taken in the manufacture of a tube. 
Assuming Normal variation, estimate: 

(i) the percentage of tubes which would take less than 130 seconds to 
manufacture; 

(ii) the percentage of tubes which would take more than 160 seconds to 

manufacture. (m.e.i.) 

13 In a factory, cylindrical pins are made to fit into circular holes in blocks 

of metal. The diameter of a pin is a Normal variable with mean 9.80 mm 
and standard deviation 0.10 mm. The diameter of a hole is a Normal 
variable with mean 10.00 mm and standard deviation 0.12 mm. A pin 
and hole 4 fit 5 if the diameter of the hole exceeds that of the pin by not more 
than 0.30 mm. Find the probability of getting a ‘fit’. (Assume difference in 
diameters to be Normally distributed.) (c.) 

14 Two independent random variables x x and x 2 have zero mean and 
thesame variance, o 2 . Given that y x = a x x x + a 2 x 2 and # 2 = b x x x b 2 x 2 , 
where a x , a 2 , b x , b 2 are constants, show: 

(*) that F[>j] = ( a\ + a\)t t 2 ; 

(ii) that the condition for Cov \_y x , yf\ = 0 is a x b x + a 2 b 2 = 0. 

Obtain an expression for V[z] where z — A x y x + X 2 y 2 and A x , X 2 are 
constants. 


Mean of n independent variables 

We will consider a sample of n random variables x x , x 2 , . . ., x n , chosen 
independently from some background population. (Note that we are 
using the ‘names’ x x , x 2 , . . ., x n in a somewhat different sense from that 
used previously, when they stood for the different values a single variable 
could take; convenience outweighs any possible ambiguity). Then each 
variable individually will have the same expected value (X and variance a 2 
as the background population. Let m be the mean of the sample. 


Then 


E[m] — E 



= -£Q>i] 


n ! 

1 

= -npt 
n 


= H 
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and F[m] = £ V 


= i£n>j 


= —z no 
n 


n 


So 


E[m\ = fi and 


V[m~\ = 


n 


and the standard deviation of the mean (also known as the standard error of 
the mean) is given by o/yjn. 

Hence we are absolutely justified in using m as an estimate of jU; and 
since the variance of the mean is less than that of individual readings, it 
follows, as expected, that there is less variation in the values of m than in the 
individual values of x t . 


Small and large samples 

Although we have been able to specify how the expected value and variance 
of any sample mean may be obtained, for a small sample there is no 
immediate way of stating its probability distribution; each must be 
analysed individually. 

For large samples however, we can use the central limit theorem, which 
tells us that any mean of a large sample has an approximate Normal dis¬ 
tribution, irrespective of the population from which it is drawn. 


Samples from a Normal distribution 

Since the central limit theorem tells us that the mean of a large sample is 
Normally distributed, it seems reasonable to assume (as can be proved) 
that the mean of any sample (large or small) drawn from a population which 
is itself Normally distributed, will be exactly Normal with expected value ju 
and standard deviation o/^Jn. 

Example 4 

A firm produces jars of jam in such a way that the net mass of jam per jar is 
known to be Normally distributed with mean 340 g and standard deviation 
12 g. The jars are packed in cartons of 16. In what percentage of the car¬ 
tons will the average mass of jam per jar be less than 336 g? 

For a sample of 16 jars, the mean mass m will be Normally distributed 
with {i — 340 and o = 12/^/16 = 3. 
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/ 336 — 340\ 

Therefore p(m < 336) = <1>(---j 

= 0)( —1.33) 

= 1 - <D(1.33) 

= 1-0.91 (to 2 d.p.) 

= 0.09. 

So the average mass of jam per jar will be less than 336 g in 9% of cartons. 


Example 5 

A laboratory obtains a certain chemical solution from the manufacturers in 
100 -cm 3 containers. The manufacturers state that on average the mass of 
impurity in each container is 3.8 mg, and its standard deviation is 0.9 mg. 
For a certain experiment the laboratory wants to ensure that there is at 
most a 5% chance that the mass of impurity in each 100 cm 3 of solution 
it uses exceeds 4.0 mg. How many containers of solution need to be mixed 
together to ensure this? 

Suppose that we mix n containers of chemical. We will assume that n is 
large, so that the mean mass of impurity in the sample so obtained is 
approximately Normally distributed, with 

= 3.8 mg and a = 0.9 /yjn mg. 


We require p(m >4.0) < 0.05 
=> p(m ^ 4.0) > 0.95 


O 


/4.0 - 3.8\ 
V 0.9 TJn ) 
0-2 Jn 
0.9 


> 


> 


0(1.67) 

1.67 


■ N jn > 1.67 x 4.5 = 7.52 
n > 56.7. 


So at least 57 containers of the solution need to be mixed together. 


Exercise 12.5b 

1 Assuming that the heights of male college students are Normally dis¬ 
tributed with mean 173 cm and standard deviation 8 cm, what is the 
probability that the mean height of a sample of 40 students will 

(i) exceed 175 cm; 

(«) lie between 172 and 174 cm? 

2 The marks obtained in a mathematics examination are approximately 
Normally distributed with mean 54 and standard deviation 13. What is 
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the probability that the average mark for a group of 10 students 

(i) exceeds 60; 

(ii) is less than 50? 

3 A certain industrial process takes a worker an average of 24 minutes to 
complete, and the standard deviation of the completion time is 6 minutes. 
If the process is repeated 100 times, use the central limit theorem to find 
approximately the probability that the mean process time exceeds 25 
minutes. 

4 Each schoolboy in a class of 36 spins an unbiased coin 100 times. What 
is the probability that the average number of heads obtained per boy 

(i) is less than 49; 

(ii) is greater than 52? 

(A continuity correction is not necessary in this case.) 

5 Capsules of a certain drug have a nominal net mass of 10 g. They are 
packed in batches of 100 capsules. A machine fills the capsules with the 
drug in such a way that the masses are normally distributed with mean net 
mass 10.06 g and standard deviation 0.2 g. Find the probability that: 

( i ) a batch does not contain any ‘underweight’ capsules; 

(ii) the mean net mass of the capsules in a batch does not fall below the 
nominal net mass. 

6 A coal merchant sells his coal in bags marked ‘50 kg’. In fact he claims 
that the average mass is 50 kg, with a standard deviation of 1 kg. A suspici¬ 
ous trade inspector has 60 of the bags weighed, and finds that their mean 
mass is 49.6 kg. What is the probability that such a result, or a more extreme 
one, could have arisen if the coal merchant’s claim is true? Do you think 
that the inspector’s suspicions were justified? 

7 Electric light bulbs are produced with a mean life-time of 1 600 hours 
and a standard deviation of 250 hours. Calculate the minimum sample 
size necessary to ensure that there is at most a 1% chance of the average 
life-time of such a sample being less than 1 550 hours. 

(Assume that the sample will be large enough for the mean to have an 
approximately Normal distribution.) 

8 How large a sample would you take in order to estimate the mean of a 
population, so that the probability is 0.95 that the sample mean will not 
differ from the true mean of the population by more than 0.2 of the standard 
deviation of the population? 


Unbiased estimates 

If T is some statistical estimate calculated from data, and 9 a probability 
parameter, then if Z£[7"] = 6, T is said to be an unbiased estimate of 6. 

It should be noted, however, that an unbiased estimate is not necessarily 
a particularly good estimate — its variance, for instance, may be quite 
large. It is simply the best we have. 
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It therefore follows, since E\m ] = jU, that the sample mean m is an 
unbiased estimate of /t, the mean, or expected value, of the parent popula¬ 
tion. 

Since m is an unbiased estimate of ju, it would seem to follow that s 2 
should be an unbiased estimate of a 2 ; after all, both are referred to as 
variances, and as we saw in 12 . 1 , their forms are very similar. To pursue 
this further, we note that s 2 is itself a random variable, depending only on 
the sample from which it is calculated, and we can therefore consider its 
expected value. 


Now s 2 = ^£( Xi - m) 2 , 

and in this form it is a little difficult to manage, since we cannot say any¬ 
thing directly about E\_(x t — m) 2 ~\. However, we do know that 

£[(*, " /^) 2 ] = * 2 , 

Also, recalling that x t has mean m and variance s 2 , it follows that x t — fx 
has mean m — fi and variance s 2 . 

S° s 2 = 1 £ (Xf - n) 2 - (m - n) 2 
=> E{.s^ = £ (*i - M) 2 ] - El(m - n) 2 -] 


= a 2 - F[m] 



n 


So s 2 is not an unbiased estimate of cr 2 , and in fact j 2 will tend to give a 
low value for a 2 (as can readily be seen by considering the zero value of j 2 
that would arise if the sample had only one member). 


Therefore 



n — 1 


so that an ‘ unbiased ’ estimate of a is 





338 PROBABILITY DISTRIBUTIONS AND STATISTICS 

Exercise 12.5c 

1 The lengths, in millimetres, of eight full-grown fish of a certain species 
were found to be: 56,49,68, 58,63,60,55, 59. Obtain unbiased estimates for 
the mean and variance of the length of the species. 


2 A sample of 15 leaves was obtained from a certain plant, and the areas 
of these leaves, in square centimetres, are given below: 

4.60, 5.85, 5.13, 3.94, 5.61,4.08,4.71,4.29, 6.01,5.46, 5.80,6.33,4.43,4.90, 
5.09. 

Obtain unbiased estimates of the mean and variance of leaf area (correct 
to 2 decimal places). 

3 In a sample of bars of chocolate produced in a particular factory, the 
mean mass was found to be 101.3 g and the standard deviation 1.6 g. 
Obtain an unbiased estimate (correct to 2 decimal places) of the standard 
deviation of all bars of chocolate produced in the factory if the sample size 
was: 

(i) 5; (ii) 20; (iii) 100. 


4 t is an unbiased estimator for a parameter 0 , and t i , i — 1 , 2 , . . ., n are 
values of t obtained from n experiments. 

(i) Prove that 
1 n 

— V is an unbiased estimate of 0 . 
n i= 1 

(ii) Find the relationship between the numbers k h (i = 1,. . .n) if E k^ 
is an unbiased estimate of 0 . 


5 A random variable x has expectation (i and variance < 7 2 . Find the 
variance of 


1 " 

11*« 


i = 1 


and the expected value of s 2 = -- £ (x t — x) 

n — 1 


where x x , x 2 , . . ., x n are n independent observations from the distribution 
of x. 

A further set of m independent observations ta ^en 

from the same distribution and 


m 


ttt i 

1 / = 1 


yf 
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If u 2 — Xs 2 + (1 — X)t 2 , (0 < X < 1), show that s 2 , t 2 and u 2 have the 
same expectation and find X such that V\u 2 ~\ is a minimum. 

^Take F|> 2 ] = (m.e.i.) 


*12.6 Significance and confidence 

In a television advertisement, out of a sample of eight housewives inter¬ 
viewed leaving a supermarket, six were found to prefer the new washing 
powder, ‘Squelch’. Is this convincing evidence that more housewives use 
Squelch than all other powders? 

This question involves firstly practical and experimental considerations, 
and secondly mathematical considerations. The first thing we must know 
is how the eight housewives were chosen. Were they chosen randomly? 
(For the present we must be content with an intuitive notion of random¬ 
ness.) Or did the interviewer particularly look out for shopping bags con¬ 
taining packets of Squelch? Such questions must be asked before any 
meaningful mathematical deductions can be made. However since we are 
primarily concerned with the mathematics of such situations, we will 
assume that these practical questions have been settled, and for the pur¬ 
poses of this example we will assume that the eight housewives were chosen 
randomly. 

We shall not (and indeed we cannot) answer the question about prefer¬ 
ence for Squelch directly. Instead we invert the question and ask: 

‘Assuming that exactly 50% of housewives prefer Squelch to other 
powders, what is the probability that this result (six out of eight choices 
for Squelch), or a more extreme one, will occur by chance? 5 

(The phrase ‘ this result, or a more extreme one 5 though it seems to complicate 
the problem, does in fact provide us with a standard method for dealing 
with such problems and the necessity of its inclusion will become in¬ 
creasingly obvious.) 

If this probability is large, then there is no reason to doubt the assumption, 
but if the probability is small then there is considerable evidence against 
our assumption; we become sceptical about the assumption that house¬ 
wives are indifferent, and increasingly certain that they do indeed prefer 
Squelch to other powders. 

The phrase ‘it is not true that more housewives prefer Squelch to any 
other powder 5 could be interpreted mathematically in a number of ways. 
We will consider the most extreme case for which the statement holds, when 
exactly half prefer Squelch, so that the statement is only just true. (Apart 
from any other considerations of course, this will simplify the mathematics.) 
We therefore have a binomial situation, with ‘success 5 being regarded as 
finding a housewife who prefers Squelch. 

So n — 8, p = and the probability distribution is: 
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- LL 

~ 256 

= 0.145 w \. 


Therefore there is a probability of y that six out of a sample of eight house¬ 
wives would choose Squelch even when there is no overall preference for 
the powder. So we should regard the evidence of the advertisement as not 
particularly convincing. 

This analysis is one example of a very simple significance test. The proba¬ 
bility y is a measure of the significance of the experimental result on the 
assumption of no overall preference. More generally, significance testing 
requires the use of two basic concepts: null hypothesis and level of significance. 


Null hypothesis 

This is the basic hypothesis upon which we build a probability model. It 
should be a precise statement, which lends itself to mathematical formula¬ 
tion. In our example the null hypothesis was that exactly half of housewives 
prefer Squelch. The term ‘null’ arises primarily from contexts such as 
medical research, where new treatments or drugs are tested for effective¬ 
ness, the null hypothesis being that such a treatment has no effect; and 
generally speaking a null hypothesis takes the form of just such a negative 
statement. 


Levels of significance 

Using the null hypothesis, we evaluate the probability of obtaining such an 
extreme value as our sample statistic: the smaller this probability, the 
greater is the evidence against the null hypothesis. We normally use three 
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standard levels of significance: 5%, 1%, and 0.1%, corresponding to 
probabilities of 0.05, 0.01, and 0.001 respectively. If, for instance, we 
obtain a probability of 0.03, we say that this result is significant at the 5% 
level (but not at the 1% level), and we can therefore reject the null hypo¬ 
thesis at the 5% level. Which significance level is used generally depends 
on the nature of the experiment or research. 


Errors 

There are two distinct types of error which may arise from significance 
testing: 

1 A sample may not provide significant evidence against the null hypo¬ 
thesis, and so the hypothesis may be accepted when in fact it is incorrect. 

2 If we reject the null hypothesis at, say, the 5% level, then in 5 out of 100 
trials the hypothesis could be correct, and the extreme probability will have 
occurred purely by chance. The incidence of this type of error can of course 
be reduced by using a higher level of significance, for example the 1% 
level. 

But further detailed analysis of such errors is beyond the scope of this 
text, and for the present we must just accept that such unavoidable errors 
may arise. 

The following two examples are typical significance tests. 


Example i 

Before the introduction of a new drug, the success rate in treatment of 
patients for a certain disease was 43%. However, of 88 patients treated with 
the drug, 51 recovered. On the basis of this evidence, is the drug signifi¬ 
cantly effective in treating the disease? 

Null hypothesis : That the drug has no effect, i.e., the probability of a patient 
recovering is still 0.43. 


We therefore have a binomial situation, with n = 88, j b — 0.43. 
Since n is large we can approximate by a Normal distribution with 

fi = 88 x 0.43 = 37.8 
and a = % /(88 x 0.43 x 0.57) = 4.64. 

Therefore p {51 or more recoveries) — 1 — p(^ 50 recoveries) 


= 1 



(using a continuity correction) 
= 1 - #(2.74) 

= 1 - 0.997 
= 0.003. 
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As 0.003 < 0.01 ? this result is significant at the 1% level. So the proba¬ 
bility, on the basis of the null hypothesis, of obtaining the observed result 
is very remote, and we can say with considerable confidence that the drug 
is effective. 

Example 2 

The machine described in example i of section 12 . 5 , filling jars with jam 
such that the mass of jam is Normally distributed with ju = 340 g and 
a — 12 g, breaks down. After it is repaired, it is found that the average 
mass of jam in the first 100 jars it fills is 342 g. Has the performance of the 
machine significantly altered? 

Null hypothesis'. The machine is unchanged, i.e., the means of samples, 
size 100, are Normally distributed, with mean 340 and standard deviation 
12/VlOO =1.2. 

We need to find the probability of a result of 342, or one more extreme, 
occurring by chance. In this case, however, a more extreme result could 
reasonably be one less than 338, since we are not really concerned whether 
the mass of jam is increased or decreased, but simply whether it has changed, 
and the relevant probability is indicated by the shaded areas in the diagram. 



For fairly obvious reasons, therefore, this is referred to as a two-tailed (or 
two-sided) test, as opposed to the previous example which was one-tailed. 

T _/342 - 340VI 

Now jfr(338 ^ mass ^ 342) = 2 1 - <X>I-—-1 

= 2[1 - 0(1.67)] 

= 2[1 — 0.952] 

= 0.096. 


This is not significant at the 5% level, and so we can say that this evidence 
does not indicate any significant alteration in the machine’s performance. 
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Exercise 12.6a 

1 A coin is tossed eight times, and a head is obtained each time. Test, at 
the 1% level, the hypothesis that the coin is unbiased. (Use a two-tailed 
test.) 

2 An amateur magician claims that he can read people’s minds. To prove 
this he tells a person to think of one of the colours blue, yellow or red, and 
he then predicts what that colour is. In ten such mind-reading sessions he is 
correct seven times. Is this sufficient evidence to accept his claim? 

3 Test, at the 5% level, the hypothesis that the variable x is Normally 
distributed with mean 80 and standard deviation 10 , if we obtain: 

(i) a value of x equal to 100 ; 

{ii ) a value of x equal to 65; 

(iii) three values ofx, all less than 70; 

(iv) three values of x, whose average is equal to 70. 

4 Over a number of years an average of 40 out of 100 patients who under¬ 
went a difficult operation survived. Last year new medical techniques were 
introduced and 73 out of 150 patients survived the operation. Explain 
whether or not the maintaining of these techniques is statistically justified. 

(s.M.P.) 

5 In a certain constituency, in the last election, f of the electorate voted 
for the Nationalist Party. In a sample of 200 voters it is now found that 107 
support the Nationalists. Test the hypothesis that the support for the 
Nationalist Party has not decreased since the last election. 

6 A tyre company claims that the lifetimes of its tyres are Normally dis¬ 
tributed with mean 34 000 km and standard deviation 6 000 km: 

(i) A motorist finds that one of their tyres lasts only 26 000 km. Does this 
lead you to doubt the manufacturers’ claims? 

(«) A car-hire firm using the tyres finds that the average life-time of 100 
such tyres is 32 200 km. Does this evidence lead you to doubt the manu¬ 
facturers’ claims? 

7 The masses of a certain species of rabbit are known to be Normally dis¬ 
tributed with mean 1.68 kg and standard deviation 0.24 kg. Nine rabbits 
are fed on specially enriched foodstuffs, and their average mass after two 
months is found to be 1.85 kg. Test at 

(i) the 5% level, 

{ii) the 1 % level, the hypothesis that the foodstuff does not increase the 
. rabbits’ masses. 

8 The following experiment was carried out 100 times by each of 16 pairs 

of children in a class. ‘ Each pair has an ordinary pack of cards from which 
one child selects a card and replaces it; the other child does likewise, and 
they score a point if the suit is the same for both ’. The average number of 
points scored per pair was 27.2. Test at the 5% significance level, the 
hypothesis that all the cards were drawn at random. (s.m.p.) 
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9 A random sample of 200 persons from a large population is examined 
for eye colour. In this sample 50 persons are found to have blue eyes. Is this 
result consistent with the hypothesis that the proportion of blue-eyed per¬ 
sons in this population is 0.2? What would have been your conclusion if 
there had been 200 blue-eyed persons in a random sample of 800 persons? 

(m.e.i.) 

10 A grower sows 200 Carlton lettuce seeds under conditions such that 
the average germination rate is 75%. By using a suitable approximation, 
find 

(z) the probability that more than 170 seeds germinate, 

(«) the probability that less than 140 seeds germinate. 

The grower also sows 120 Alberni lettuce seeds under the same conditions 
as the Carlton seeds, and finds that 82 seeds germinate. Test whether the 
Alberni seeds have a germination rate less than 75%. (c.) 

11 The protein intakes for a sample of six unemployed men was observed 
to be 86 , 82, 66 , 66 , 94, 104 g day' 1 . A large number of employed men 
had an average intake of 97.6 g day -1 and the variance of their intake 
was 225. 

Do these data support the view that at the time of the study unemployed 
men were underfed by comparison with men in employment. (m.e.i.) 

12 You are engaged as an expert witness for the prosecution in a court 

case in which a gaming club is accused of running an unfair roulette wheel. 
The evidence is that out of 3 700 trial spins, zero (on which the club wins) 
turned up 140 times. There are 37 possible scores on a trial spin, labelled 
0 to 36, and these should have equal probability. Test whether there is 
evidence that the wheel is biased. (o.) 

13 According to a certain hypothesis the variable x is uniformly dis¬ 
tributed in (0, 1). Determine the probability that the smallest member of a 
sample of n observations of x has value not less than X where 0 ^ X ^ 1. 

Ten observations were taken ofx; the values of the smallest and largest 
were 0.2 and 0.7. Calculate the probability that ten observations: 

(i) are each not less than 0 . 2 ; 

(ii) are each not greater than 0.7; 

(Hi) lie within an interval of length 0.5. 

State whether your results would lead you to reject the hypothesis. (c.) 

14 In a sample of 1 000 voters, an opinion pollster finds that 520 intend 
to vote for Jones and 480 for Smith in the forthcoming election. What is the 
probability of a result such as this, or one even more favourable for Jones, if 
in fact Jones has only the support of 50% of the electorate? How many 
people in the sample would need to support Jones to make it significant 

(i) at the 5% level, 

(ii) at the 1 % level; 

that he has more support than Smith. 
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15 A children’s test which had been standardised some years ago to have 
a mean of 100 and a standard deviation of 15 was applied recently to a 
random group of children, and their mean score was recorded as 101.6, but 
no record was made of the number of children tested. What is the least 
number of children tested if the result shows that the population mean is 
unlikely still to be 100? (You should assume that the mean value will cer¬ 
tainly not have decreased. Work at the 1% significance level.) (s.m.p.) 


Confidence intervals 

The links in lengths of chain produced in a factory are known to have 
breaking tensions which are Normally distributed with standard deviation 
0.24 kN. Eight links are tested to destruction, and are found to have break¬ 
ing tensions of 3.61, 3.58, 3.66, 3.65, 3.89, 3.48, 3.50, 3.59 kN. What can 
we say, if anything, about the average breaking tension of all links pro¬ 
duced in the factory? 

Let us denote the average breaking tension (of the background popula¬ 
tion) by jU. 

Now the sample mean (which for this sample is 3.62) is distributed 
Normally, with the same [i as the background population, and a — 0.24/^/8 
= 0.085. 

We know that (m — \i)/g has a standard Normal distribution, and from 
the properties of the standard Normal curve, it is 95% certain that 
(m — jx)/o lies between — 1.96 and 1.96. 



- < 1.96J = 0.95 

=> p( — 1.96cr < m — \i < 1.96(7) = 0.95. 

Rewriting the orderings, 

p(m — 1.96cr < jii < m + 1.96(7) = 0.95. 

Therefore for the chain links, it is 95% certain that 
3.62 - 1.96 x 0.085 < fi < 3.62 + 1.96 x 0.085 


So ( — 1.96 < 
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=> 3.62 - 0.17 < n < 3.62 + 0.17 

=> 3.45 < fi < 3.79. 

The interval 3.45 to 3.79 is referred to as the 95% confidence interval for ^u, 
and 3.45 and 3.79 are the 95% confidence limits. 

Similarly the 99% confidence limits for ji are 


3.62 ± 2.58 x 0.085 
= 3.62 ± 0.22 
(i.e., 3.40 and 3.84). 

The 95% and 99% confidence intervals are the standard intervals used, 
but there is no reason why others should not be employed. Quite naturally, 
the more certain we need to be of the interval within which lies, the 
larger that interval becomes. So for any experimental results, we have to 
balance the usefulness of the result with the confidence we can put in it, in 
choosing a suitable interval. Also, of course, the larger the sample used, 
the narrower will be the confidence interval so obtained. 

The general procedure for establishing confidence limits for fi (assuming 
a Normal distribution, with known standard deviation) is as follows: 

(: i ) Obtain a representative value — this may be a single reading or 

(better) the mean of a sample — and determine its standard deviation. 

(ii ) Decide on a confidence coefficient: 0.95 is for example the confidence 
coefficient corresponding to a 95% confidence interval. Denote this co¬ 
efficient by 1 — 2a (so for a 0.95 coefficient, a = 0.025). 

{Hi) Define c a by 

=« 

=> <5(0 = 1 - a, 



therefore the unshaded area is 1 — 2a, and the shaded areas are each a. 

(iv) Then, in the case of the sample mean. 

<7 

< }i < m H- y~ c a 

yjn 

and the confidence limits are m + acjyjn. 

Similarly, for a single observation x, the confidence limits for ju are x + ac a . 
However the confidence intervals obtained from single observations are 
usually so wide as to be of little real use. 

This technique of obtaining confidence intervals is very useful. But it 
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does have limitations, the most obvious of which is that the standard 
deviation of the population has to be known; in practice, however, o is 
often not known, and has to be estimated from the sample. A more com¬ 
plicated technique has then to be employed (using what is known as the ‘ V 
distribution), and the effect of using an estimated value of a is to widen 
the equivalent confidence intervals for a known value of <7. However, for 
large samples the process described above can given good approximations 
to the confidence limits even when the standard deviation has to be esti¬ 
mated. (See exercise 12 . 6 b, no. 7 .) 


Exercise 12.6b 

1 It is known that the variable x is Normally distributed with standard 
deviation 8 . The average size of a sample of 10 values of x is found to be 63. 
Find (i) 90%, (ii) 95%, and (Hi) 99% confidence limits for the mean value 
of x. 

2 The melting points, in °C, of 10 samples of a certain metal were found 
to be 1 154, 1 151, 1 154, 1 150, 1 148, 1 152, 1 155, 1 153, 1 149, 1 154. 
Past experience indicates that these observations will be Normally dis¬ 
tributed, with standard deviation equal to 3 °C. Find (i) 95%, and (ii) 99% 
confidence limits for the mean melting point of the metal. 

3 It is known that the heights of 14-year old schoolboys are Normally 
distributed, with a standard deviation of 9.2 cm. The average height of 
16 such schoolboys, chosen at random in a certain school, is found to be 
159.1 cm. Find (i) 95%, and (ii) 99% confidence limits for the average 
height of 14-year old schoolboys. 

4 The average length of 200 nails produced by a certain machine was 
found to be 7.42 cm. If the standard deviation of nail lengths is known to 
be 0.53 cm, find (i) 95% and (ii) 99% confidence limits for the mean length 
of all nails produced by the machine. 

5 It is known that an examination paper is marked in such a way that the 
standard deviation of the marks is 15.1. In a certain school, 80 candidates 
take the examination, and they have an average mark of 57.4. Find 
(i) 95% and (ii) 99% confidence limits for the mean mark in the examina¬ 
tion. 

6 A factory manufacturing ammeters tests them for zero errors in their 
calibration. From past routine tests, it is known that the standard devia¬ 
tion of these errors is 0.3. 

A batch of 9 ammeters, taken from one worker’s production has zero 
errors of 1.0, —0.1, —0.3, 1. 6 , 0.5, 0.4, 0.5, 0.2, —0.2. Test whether there 
is evidence of bias in the ammeters produced by this worker, and establish a 
95% confidence interval for the mean zero error of his ammeters. (o.) 



34 ^ PROBABILITY DISTRIBUTION AND STATISTICS 

7 A random sample of 100 capacitors, each of nominal capacitance 
2 jiF was taken from a very large batch. The capacitance of each capacitor 
in the sample was measured. The results of these measurements are sum¬ 
marised in the following table: 


Capacitance/pF 
(mid-interval value) 

1.85 

1.90 1.95 2.00 2.05 2.10 2.15 

No. of capacitors 

2 

12 23 31 20 10 2 


Calculate the mean and the standard deviation of these capacitances. 
Calculate also the standard error of the mean, and use it to determine 95% 
confidence limits for the mean capacitance of capacitors in the batch. (Use 
a Normal approximation.) (m.e.i.) 

8 In a random sample of 1 000 housewives from a large population 300 
stated that they used a certain detergent. Show that 95% confidence 
limits for the proportion of the population using this detergent are (approxi¬ 
mately) 0.272 to 0.328. (Use a Normal approximation to the binomial 
distribution, and an estimated value of a.) 

Following an advertising campaign a second sample of 800 housewives 
was taken, and of these 260 stated that they used the detergent. Is this 
evidence of the success of the campaign? 

Subsequently it was decided to give a free gift with each packet of the 
detergent, and later in a random sample of 600 housewives 216 stated that 
they used the detergent. Does this indicate the success of the free gift 
scheme? (m.e.i.) 

9 The probability density function of a variable x takes the form shown 
in the diagram, the constant a being unknown. A single observation x y is 
made of x. Give an unbiased estimate of <z, and calculate 95% confidence 
limits for a. (Use first principles.) 


a 


x 



After a total of 24 observations have been made, the mean x is calculated. 
Use this mean to obtain an unbiased estimate of a, and calculate approxi¬ 
mate 95% confidence limits for a. (c.) 
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*12.7 Correlation and regression 

Bivariate data 

The marks of a class of 20 pupils in a mathematics examination, and a 
French examination are given in the table below. 


Mathematics 

10 

13 

17 

19 

21 

22 

22 

22 

23 

24 

French 

10 

16 

11 

16 

17 

17 

18 

20 

22 

18 

Mathematics 

24 

26 

27 

28 

31 

31 

31 

34 

35 

37 

French 

25 

20 

21 

18 

18 

24 

26 

29 

22 

25 


Since the marks are paired for each pupil, such data is referred to as 
bivariate data (as opposed to univariate data, where each item is treated 
individually; all the data so far dealt with in this chapter has been uni¬ 
variate, though we rarely use this term). It is generally convenient to 
represent bivariate data in the form of ordered pairs. Thus the data above 
would be (10, 10), (13, 16),.... 


Scatter diagrams 

To display bivariate data pictorially, we use a scatter diagram. This is 
obtained by regarding the ordered pairs of bivariate data as Cartesian 
coordinates, and marking the associated points on a graph: 



The points in the diagram all lie within a relatively narrow band, and this 
is emphasised by the boundary which has been superimposed around the 
points. This suggests some association between the x and y coordinates 
(i.e., between the mathematics and French marks). Such association is 
generally referred to as correlation. 
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The scatter diagram illustrates bi¬ 
variate data which is closely corre¬ 
lated (also referred to as positive or 
direct correlation), i.e., small values 
of x correspond to small values of y 
and large x to large y. An example of 
such data might be the number of 
pupils and number of staff in second¬ 
ary schools. 



The second diagram shows data for 
which there is little or no correlation, 
for example the number of pupils, 
and number of trees planted in the 
grounds of a group of schools. 



The third diagram illustrates negative 
(or inverse) correlation — the data 
here for example might be the num¬ 
bers of pupils in schools, and the 
average percentages of the school’s 
pupils per class. 



Regression 

A precise measure of correlation follows naturally from investigation of the 
related topic of regression. Whilst in the study of correlation both variates 
are regarded as having equal status, in regression one (say x, though either 
may be chosen) is regarded as the independent variate, and the dependence 
of the other variate, y , on x is investigated. To be precise, we need to find a 
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relationship between each fixed value of x , and the expected (or mean) 
value ofy for that particular x. Let us illustrate this by a theoretical example: 

Two dice, one red and the other blue, are thrown together; let x denote 
the score on the red die and y the total score on the two dice. Then the 
possible (equally likely) values of y for each value of x are shown in the 
diagram: 



Now when x = 1, the expected value of y is equal to 
£(2 + 3 + 4 + 5 + 6 + 7)= 4.5. 

This point is marked on the diagram by a cross, and equivalent points for 
the other values of x are similarly marked. The straight line upon which all 
these points lie is then referred to as the regression line of y on x, and its 
equation in this case is y = x + 3.5. 

Similarly in the diagram below we have the regression line of x on y, 
which we see has equation y = 2x. 



In this case y has been regarded as the independent variable, and for a 
particular total score, we are considering the expected value of the score 
on the red die. 
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This example, of course, is an idealised probability model. In general, 
the relationship between x and the equivalent expected value of y will not 
necessarily be linear. But whatever the relationship, linear or otherwise, we 
shall restrict our task (at this stage) to finding a straight line approximation. 
Such an approximation is generally referred to as the line of regression of 
y on x, though it must be emphasised that such a line, derived from a 
bivariate sample, will only provide an approximation to the true regression 
relationship in the background population from which the sample is 
drawn. 

There is of course no single criterion for determining the 4 best 5 line of 
regression for a particular set of data, but in practice the most useful one is 
known as the least squares criterion , and the line so obtained is called the 
least squares line of regression of y on x\ 



If d t is the distance of the point P t from the line / in the ^-direction, and / is 
chosen in such a way that ]T df is a minimum, then l is the least squares 
line of regression of y on x. 

If instead of the line of regression of y on x, we need that of x on y , the 
least squares criterion is still used, but the deviations from the regression 
line are in this case measured in the ^-direction: 



The obvious question arises: why should we choose this particular 
criterion? Why not the more obvious one of minimising the sum of per¬ 
pendicular distances from the line? Our original definition of the regres- 
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sion relationship throws some light on the choice of deviation parallel to 
the axes, but the main justification for the criterion is on more advanced 
mathematical grounds. On a practical level, however, the algebra follow¬ 
ing from the least squares criterion is simpler than that arising, for example, 
from considerations of perpendicular distance, and the very neat and con¬ 
venient results are in themselves some justification for the method. 


Derivation of the line of regression of y on x 

Let the points P t , P 2 , . . ., P f , . . ., P„ of the scatter diagram have co- 
ordinates (x t , y,), (* 2 , y 2 ), ■ • ■ (* f , Hi), ■■■ (*„, y„)- Now if we take the 
equation of the regression line to be y = a + bx , then 

deviation d t = y t — (a + bx t ) 

= Vi - a - bx j. 

We therefore need to find values of 0, b such that 

n 

S = (y. — a — bx ( ) 2 is a minimum. 

i— 1 

We minimise S by regarding it as a function of a , b and then equating the 
partial derivatives of S with respect to a , b (written dS/da , dS/db) to zero. 

(< dS/da denotes the derivative of S with respect to a , while regarding b as a 
constant. Similarly dS/db is the derivative of S with respect to b , keeping a 
constant . A thorough treatment of partial differentiation can be found in 
most books on advanced calculus, but a detailed knowledge of the subject 
is not essential for the purposes of this section.) 

N° w ^ = -2 Y, iVi ~ a — bx i) 
dS 

and — = -2Y J x i {y i - a - bx^. 


So if dS/da = 0 and dS/db = 0, 
we obtain the equations: 

Z (yi - a - bx t) = ° => Z^ ~ na ~ b Y. x i =° (!) 

Z x i(yi - a - bx j) = 0 => X x iVi - a Z x t ~ b Z x f = 0 (2) 

and in their most convenient form, these equations can be written: 

I y t = na + b Z*i (1) 

Z*«»i=«Z*i + *Z^- ( 2 ) 

Dividing equation (1) by n, we obtain 
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and writing the means of x and y as x and y , the equation becomes 
y = a + bx. 

This tells us that the regression line passes through the ‘mean point’, 
(#, y ), so that the equation of the regression line can be rewritten as 

y -y = b(x - x). 

Hence we only need find the value of b from equations (1) and (2). This is 
done by multiplying equation (1) by £ x t and equation (2) by n , and then 
subtracting (1) from (2). This gives: 

n Z x,y t - (Z *t)(Lyi) = bln'Exf - (Z^-) 2 ] 

and dividing through by n 2 , 



and we also write 


= s xy , which is called the covariance of x, y. 

(s xy is a simplified statistical version of the covariance of x, y which arose in 

12.5O 

So s xy = bs x => b = -y- 

This quantity, s xy /s x , known as the coefficient of regression of y on x, is there¬ 
fore the gradient of the line of regression. 

The equation of the line of regression can therefore be written in its most 
convenient form as: 



If, alternatively, y is taken as the independent variable, it is clear by 
symmetry that the line of regression of x on y is given by 

$ 

X - X = -f (y - y). 
s y 

Now let us return to our original bivariate data, where the two variates 
were mathematics (*) and French (y) marks. We will see how the French 
mark depends linearly on the mathematics mark by obtaining the line of 
regression of y on x. 


_ /Z fLyi 

n \ n n 
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* 


y 

x 2 

xy 


10 


10 

100 

100 


13 


16 

169 

208 


17 


11 

289 

187 


19 


16 

361 

304 


21 


17 

441 

357 


22 


17 

484 

374 


22 


18 

484 

396 


22 


20 

484 

440 


23 


22 

529 

506 


24 


18 

576 

432 


24 


25 

576 

600 


26 


20 

676 

520 


27 


21 

729 

567 


28 


18 

784 

504 


31 


18 

961 

558 


31 


24 

961 

744 


31 


26 

961 

806 


34 


29 

1 156 

986 


35 


22 

1 225 

770 


37 


25 

1 369 

925 

2>i = 

497 

I Vi = 

393 £ x f = 

13 315 T,x0i = 

10 284 

=*■ X = 

24.85 

y = 

y #? 

19.65 ^ ' = 

20 

66575 ^ = 

514.2. 

Hence 

II 

665.75 - 

(24.85) 2 ; 

= 514.20 - 24.85 x 19.65 


= 

665.75 - 

617.52 

= 514.20 - 488.30 



= 

48.23 


= 25.90 



__ 25.90 
~ 48.23 


0.537. 


Therefore the line of regression of y on * is 

y - 19.65 = 0.537(* - 24.85) 

= 0.537* - 13.35 

=> y = 0.54* + 6.30, to 2 decimal places. 

In a similar way, we can calculate the regression line of * on y to be 


y ~ 0.84* — 1.30. 

The scatter diagram for the data is reproduced below with the two 
regression lines superimposed. The intersection of the two lines is the point 
(*, y ), since of course we have shown that this point must lie on both lines. 
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Example i (Grouped data) 

In a sample of 150 married couples, the relationship between age of 
husband and wife was investigated. The results are illustrated in the table 
below, where the numbers in the table are frequencies (e.g., 5 couples 
with both husband and wife in the age-range 16-19). 


Age of 
wife ( Y) 

Age of husband (X) 

16-19 20-29 30-39 40-49 50-59 60-69 70-79 

16-19 

5 3 

20-29 

1 25 7 2 

30-39 

1 28 5 1 1 

40-49 

1 2 24 4 

50-59 

3 18 

60-69 

1 11 2 

70-79 

1 4 


Find, as conveniently as possible, the two lines of regression for the data. 

We first of all label each interval by its average value, so that for instance 
the interval 20-29 (which includes ages actually up to 30), is labelled as 25. 
We then choose 45 as a working zero for both X and Y and scale down the 
values of X, Y by 10 to obtain the transformed (or coded) variables x, y. 
Working in terms of x, y will simplify the necessary arithmetic. The table 
then takes the form shown opposite, and we add the columns to obtain the 
frequencies f x , and the rows to obtain f y . The columns and rows signifying 
xfx, x 2 f x , yf and y 2 f y can then be easily completed and their totals cal¬ 
culated. 
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X 

-2.7 

-2 

-1 

0 

1 

2 

3 

fy 

yfy 

y% 

y\ 











- 2.7 

5 

3 






8 

-21.6 

58.3 

-2 

1 

25 

7 

2 




35 

-70 

140 

-1 


1 

28 

5 

1 

1 


36 

-36 

36 

0 


1 

2 

24 

4 



31 

0 

0 

1 




3 

18 



21 

21 

21 

2 





1 

11 

2 

14 

28 

56 

3 






1 

4 

5 

15 

45 

fx 

6 

30 

37 

34 

24 

13 

6 

150 

-63.6 

356.3 

Xfx 

-16.2 

-60 

-37 

0 

24 

26 

18 

-45.2 



X 2 fx 

43.7 

120 

37 

0 

24 

52 

54 

330.7 




So Ifx = If, = 1/ = 150. 


Z*/x = -45.2 

- 

- Yxfx 

x £/ 

-0.301 

I x 2 f x = 330.7 


I* 2 /* 
Zf ~ 

2.205 

Z yfy = - 63.6 

=> 

y z/ 

-0.424 

Yy% = 356.3 

=> 

Ty% 

Tf 

2.375 


It remains to calculate £ xyf. This is done simply by drawing out the 
table again, but with the entries this time representing xyf rather than just 
the frequency f Then the numbers in each row are added to obtain a 
further column, and the total for this column gives £ xyf. 



- 2.7 - 2-1012 3 


-2.7 

36.4 16.2 

52.6 

-2 

5.4 100 14 0 

119.4 

-1 

2 28 0 -1 -2 

27 

0 

0 0 0 0 

0 

1 

0 18 

18 

2 

2 44 12 

58 

3 

6 36 

42 


317 
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Yxyf 

So I>/= 317 => 2.133. 

Hence 4 = 2 .205 - (-0.301) 2 = 2.114 
4 = 2.375 - (— 0.424) 2 = 2.195 

= 2.133 - (-0.301) x (-0.424) = 2.005. 

Now transforming back to the original variables X and 7, 

X = 10* + 45 = 41.99 
7 = \0y + 45 = 40.76 

4 - 1004 = 211.4 

4 = lOOi 2 = 219.5 
= \00s xy = 200.5. 

So the regression coefficient of Y on X = 
and the regression coefficient of X on Y 

Therefore the equation of the regression line of Y on X is 

Y - 40.76 = 0.948 (X - 41.99) 

=> Y — 0.9X + 0.9, to 1 decimal place, 

and the equation of the regression line of X on Y is 

X - 41.99 = 0.913 (7 - 40.76) 

=> y = \.\X — 5.2, to 1 decimal place. 


200.5 
211.4 
_ 200.5 
" 219.5 


= 0.948 


= 0.913. 


Exercise 12.7a 

1 (i) Obtain the heights (*) and masses (y) of 10 people of about the 
same age. Represent this data on a scatter diagram, and draw by eye 
through the points of your diagram what you consider to be the line of 
4 best fit 5 . Calculate the equations of the lines of regression of y on x and of 
x on y, and draw them on the same diagram. 

(ii ) Obtain the heights and masses of a larger number of people, and group 
this data in a suitable way. Then calculate the equations of the two regres¬ 
sion lines, and compare with the results of part (i). 

2 An experiment was conducted to discover the effect of adding a certain 
compound to the diet of mice. The results of the experiment are: 
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No. of units of 
compound added (x) 

1 

2 

3 

4 5 6 

7 8 

Mean gain in mass of 
mice (y)/mg 

8.7 

10.9 

9.2 

10.9 11.6 11.1 

12.3 13.8 


Find the equation of the line of regression of y on x. 

3 In the following table W g is the mass of a certain chemical substance 
which dissolved in water at T °C: 


T/° C 

10 

20 

30 

40 

50 

60 

70 

80 

90 

W/g 

45 

46 

50 

56 

59 

63 

64 

67 

74 


Calculate the equation of the line of regression of W on T. Use this equation 
to obtain a tentative value for W when T = 56. (m.e.i.) 


4 A sample of eight families of a certain monkey species was chosen in 
which each family had just a single fully grown male offspring, and the 
heights of father and son for each family are given below. 


Height of father/cm 

(x) 

67 

64 

70 

71 

65 

67 

68 

69 

Height of son/cm 

( y) 

68 

62 

70 

73 

64 

69 

71 

69 


Plot the data on a scatter diagram. Find the equation of the line of regres¬ 
sion of y on and superimpose this line on the diagram. 

5 The breadth (x) and length (y) of 12 leaves from a certain tree are given 
below: 


Bread th/mm 

39 

36 

35 

35 

30 

31 

34 

28 

33 

30 

19 

29 

Length/mm 

81 

80 

74 

89 

71 

73 

76 

75 

84 

81 

77 

82 


Plot the data on a scatter diagram. Find the equations of the regression 
lines of (i) y on x, and (ii) x on y , and draw these lines on the scatter dia¬ 
gram. 

6 A certain machine processes raw material. The purities (x) of 10 batches 
of raw material are given below along with the yield (y) from each batch. 


Purity (x) 

0.49 

0.41 

0.53 

0.46 

0.47 

0.39 

0.51 

0.42 

0.49 

0.52 

Yield (y) 

24 

21 

26 

25 

20 

22 

25 

23 

25 

27 


Plot the data on a scatter diagram. Find the equation of the line of regression 
of y on x, and superimpose this line on the diagram. 

7 The chronological ages and the reading ages (in years and months) of 
10 children from a primary school are given on next page: 
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Chronological age 

7.5 

7.11 8.1 

8.2 

8.8 9.0 

9.4 

9.10 

10.2 

10.6 

Reading age 

7.8 

7.10 8.0 

7.9 

8.9 9.1 

8.8 

9.10 

12.1 

10.7 


Find the equation of the line of regression of reading age (y) on chrono¬ 
logical age (x), if x and y are measured in months. 

8 To test the effect of a new drug, twelve patients were examined before 
the drug was administered and given an initial score (/) depending on the 
severity of various symptoms. After taking the drug they were examined 
again and given a final score (F). A decrease of score represented an im¬ 
provement. The scores for the twelve patients are given in the table. 


Patient 

1 

2 

3 

4 

5 

6 

7 

8 

9 

10 

11 

12 

0 Initial (I) 

Score . / T7 , 

Final (F) 

61 

23 

8 

14 

42 

34 

32 

31 

41 

25 

20 

50 

49 

12 

3 

4 

28 

27 

20 

20 

34 

15 

16 

40 


Calculate the equation of the line of regression of F on I. 

On the average, what improvement would you expect for a patient 
whose initial score was 30? (m.e.i.) 

9 The following table gives the results of 4 estimations of a variable y at 
each of four values of a variable x. Calculate the equation of the regression 
line of y on x. 


X 

y 

2 

17 

15 

20 

18 

4 

15 

14 

16 

15 

6 

12 

14 

10 

12 

8 

8 

10 

10 

12 


Estimate the mean value of y when x = 3. (m.e.i.) 

10 The marks of 100 pupils in English language and English literature 
examinations were graded from 1 to 9 and their results are given in the 
frequency table below: 

English language (#) 

123 456789 

1 

2 

English 3 

literature 4 

iy) 5 

6 

7 

8 

9 
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Find the equations of the lines of regression of (i) y on x, and (ii) x on y . 

11 The following table gives, in coded form, the breadths and lengths of 
1 306 human heads. 


Frequencies of heads 
X (length) 



-4 

-3 

-2 

-1 

0 

1 

2 

3 

4 

5 

Total 

-4 


2 

2 

1 

1 

2 


i 



9 

-3 

1 

5 

5 

15 

8 

4 

1 

i 



40 

-2 

2 

3 

21 

34 

48 

41 

18 

3 


1 

171 

Y -1 


4 

21 

57 

92 

110 

57 

19 

5 

1 

366 

(breadth) 0 

1 

2 

17 

53 

93 

116 

62 

27 

12 

1 

384 

1 


2 

3 

12 

39 

65 

57 

40 

12 

2 

232 

2 



1 

4 

9 

17 

24 

12 

9 

1 

77 

3 




1 

1 

5 

10 

6 


2 

25 

4 







2 




2 

Total 

4 

18 

70 

177 

291 

360 

231 

109 

38 

8 

1 306 


The length X is coded with 18.85 cm as origin and the breadth Y is coded 
with 15.15 cm as origin. The interval of grouping is 0.4 cm in both cases. 
Calculate the average length and the average breadth of the heads, and 
the equation of the line of regression of breadth on length. (m.e.i.) 

12 The assets of a certain company have grown over the period 1850 to 


1966 as shown in 

Year X 

the table: 

1850 1870 

1900 

1930 

i960 

1964 

^65 

1966 

Assets/£10 5 Y 

5 56 

161 

464 

1 498 

2 250 

2 527 

2 905 


Calculate the line of regression of log 10 Y on X. (m.e.i.) 

13 Six pairs of values of x and y are given in the following table: 


X 

1 2 

3 

4 

5 

6 

y 

1.5 3.1 

4.9 

7.0 

9.4 

12.2 


It is thought that these values are connected by a relation of the form 
y — ax + bx 3 . By writing Y = y/x , and X = # 2 , reduce the relation to a 
linear one and calculate: 

(i ) the means of X and Y ; 

(ii ) the variance of X ; 

(Hi) the covariance of X and Y ; 

(iv) estimated values of a and b on the assumption that there is a linear 
relationship between X and Y. (o.c.) 



362 PROBABILITY DISTRIBUTIONS AND STATISTICS 

The correlation coefficient 

Let us consider further the general line of regression of y on x: 

y - y = % (* - *)■ 

If we divide through by s y9 the standard deviation of the y values, the 
equation can be written 

y - y = ixy_ x % - x 

Sy S x Sy $x 

Now in this equation (y - y)/s y and (x - x)/s x are standardised versions 
of the variates x and #, and the quantity s xy /s x s y , which is symmetrical in x 
and #, is known as the coefficient of correlation of x and y ; or, more exactly, the 
product-moment correlation coefficient . 

The correlation coefficient is normally denoted by the letter r, and if the 
standardised variates are written as x and the regression lines of# on x 
and x on y become: 

y f = rx' and x' = ry respectively. 



The limits on the value of r can now be determined, using a result known 
as Cauchy's inequality : 

If a { , b { are realmunbers (for i — 1,2 

then (£ aft) 2 < (£ a f) (L b ?)- 

[For consider the expression Z (Aa f + ^) 2 , where A is any real number. 

Since each term of the summation is squared, and hence positive, it follows 
that 

£ (Aflj + bi) 2 ^ 0 for all A 

=> ^ (A 2 a 2 + 2 Xafi + b 2 ) 5 s 0 

=> A 2 £ <z 2 + 2A £ + X > 0 for a11 

Now we know that in general if ax 2 4 - 2bx + c ^ 0 for all x, then b 2 ^ 

So if we regard A 2 £ a? + 2A £ + X as a quadratic function in 

A, it follows that 

(X ^,) 2 < (X a f ) (X *?)•] 

Now for the correlation coefficient, r, 

2 _ 4 _ n:(xt-s)(9t-9)T 

r s 2 s 2 Z (*, - X) 2 Z (y, - £) 2 

So, using Cauchy’s inequality with (x t — x) — a t and (y t — y) = b t , 
see that r 2 ^ 1 . 


we 
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Hence — 1 < r < 1. 
Summarising, 


r = ^2- and 


1 r ^ 1 


Let us now look at three particular values of r : 

(i) r = 1: In this case, the regression lines of y on x and x on y coincide, 

and if standardised they have gradient 1; this means that 
all the points of the scatter diagram are collinear, and we 
say that there is exact positive (or direct) correlation between 
the variates. 

(ii) r = 0: The regression line of y on x is horizontal, and that of x on y 

is vertical. This indicates a complete lack of linear associa¬ 
tion between the variates, which are said to be uncorrelated. 

(in) r = — 1: The regression lines coincide, and if standardised have 
gradient — 1; small values of x correspond to large values 
of y and vice-versa, all points being collinear. In this case 
we say that there is exact negative (or inverse) correlation 
between the variates. 


Significance of the correlation coefficient 

If, in a particular example we obtain a value of r close to 1 or a value which 
is nearly zero, then little further analysis is necessary; but how do we inter¬ 
pret a correlation coefficient of (say) 0.5? We wish to know its significance, 
that is the probability of obtaining such a high value if the variates are 
uncorrelated (i.e., with the null hypothesis that r = 0 for the background 
population). This will depend on the size of the sample from which r is cal¬ 
culated : the more data we use, the more confident we can feel about the 
value of r. The mathematics involved in the relevant significance test is too 
involved for this text, but we can (without at present being concerned about 
the mathematical justification) use a graph like that overleaf. This shows 
the size of samples required to give 5%, 1%, and 0.1% significance levels 
for values of the correlation coefficient r (using a one-tailed significance 
test). 

So, using the graph, we can say that a correlation coefficient of 0.5 
obtained from a bivariate sample of 15 pairs is significantly different from 
zero at the 5% level, but if obtained from a sample of only 10 pairs is not 
significant. Similarly a value of r = 0.8, obtained from a sample of size 9, 
is significant at the 1% level, but not at the 0.1% level. 
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Warning 

A significant correlation coefficient does not necessarily imply causal con¬ 
nection between two quantities. There is, for instance, a correlation 
coefficient of 0.75, which is significant at the 5% level, between the number 
of indictable crimes committed and life expectation in Great Britain 
between 1900 and 1960 (measured at 10-yearly intervals). But we would 
hardly accept that an extended life-span increased a person’s criminal 
tendencies; nor even vice-versa! 


Example 2 

Calculate the product-moment correlation coefficient for the mathematics 
and French marks given at the beginning of the section, and investigate the 
significance of this coefficient. 


We have already calculated 

s xy 

= 25.90 

and si = 48.23 

=> s x 

= 6.94. 

Now for the data, 

Z yf 

= 8 159 

=> 

Tyf 

= 407.95 


n 


=> 


= 407.95 



= 407.95 



= 21.83 

=> 

Sy 

= 4.67. 


(19.65) 2 

386.12 



Therefore 


— xy — 
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25.90 


6.94 x 4.67 


= 0.80. 


Now this value of r was calculated from 20 pairs of data, so using the sig¬ 
nificance graph we can see that this correlation coefficient is definitely sig¬ 
nificant at the 0.1% level. This is therefore extremely strong evidence of 
correlation between the mathematics and French marks in the population 
from which the sample was drawn. 


Example 3 

The heights and masses of a random sample of 10 first-formers from Watford 
Grammar School are given below. 


Height/cm 

156 

151 

152 

160 

146 

157 

149 

142 

158 

141 

Mass/kg 

47 

38 

44 

55 

46 

39 

45 

30 

45 

32 


On the basis of this sample, how significant is the correlation between 
height and mass? 

Calculation will be considerably simplified if we subtract 150 cm from 
each height and 40 kg from each mass. This is equivalent to translating the 
axes of the scatter diagram, so does not affect the value of the correlation 
coefficient. The data therefore becomes: 


Height (*) 

6 1 2 10 -4 7 -1 -8 8 -9 

Weight (y) 

7 -2 4 15 6 -1 5 -10 5 -8 


and the calculations are set out in tabular form: 


X 

y 

x 2 

y 2 

xy 

6 

1 

36 

49 

42 

1 

-2 

1 

4 

-2 

2 

4 

4 

16 

8 

10 

15 

100 

225 

150 

-4 

6 

16 

36 

-24 

7 

-1 

49 

1 

-7 

-1 

5 

1 

25 

-5 

-8 

-10 

64 

100 

80 

8 

5 

64 

25 

40 

-9 

-8 

81 

64 

72 

12 

21 

416 

545 

354 


Therefore x = 1.2 and y — 2.1. 
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So, working correct to 3 significant figures 

4 = 41.6 - 1.2 2 , s 2 y = 54.5 - 2.1 2 , s xy = 35.4 — 1.2 x 2.1 

= 40.2 = 50.1 = 32.9. 

=> S x = 6.34, = 7.08, 


Hence 


32.9 

r ~ 6.34 x 7.08 


0.73, 


and using the significance graph, we find that this correlation coefficient is 
significant at the 1% level. 


Exercise 12.7b 

1-6 Calculate the correlation coefficients for the data in exercise 12.7a, 
nos. 1-6, and comment on the significance of the coefficient in each case. 

7 Calculate the coefficient of correlation between the mass of the heart 
and mass of the liver in mice, using the following data. 


Mass of organ 


Heart/10 2 g 

20 

16 

20 

21 

26 

24 

18 

18 

Liver/10 -2 g 

230 

126 

203 

241 

159 

230 

140 

242 


What do you conclude from this analysis? (m.e.i.) 

8 The following table gives the number of goals scored at home and away 
by ten teams in the First Division of the Football League, part way through 
the 1972-3 season: 


Home 

25 

20 

12 

15 

14 

24 

20 13 14 10 

Away 

10 

14 

12 

13 

6 

10 

8 5 *6 7 


Calculate the product-moment correlation coefficient for home and away 
goals. 

9 The intelligence quotients of 8 students were assessed, and the time (in 
minutes) it took them to complete a certain piece of work was measured. 
Their I.Qs. and times are given below: 


I.Q. 

121 

118 

132 

141 

140 

137 

124 

130 

Time 

28 

26 

23 

20 

16 

17 

22 

19 


Is the negative correlation between these significant? 

10 Calculate the correlation coefficient for the data of exercise 12.7a, 
no. 10. 

11 Calculate the correlation coefficient for the data of exercise 12.7a, 


no. 11. 
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12 Calculate the product-moment correlation coefficient for the bivariate 
data given in the following frequency table: 


N. X 

y\ 

-3 

-2 

-i 

0 

1 

2 

3 

4 

-2 

4 

6 

2 

1 





-1 

1 

8 

4 

3 


2 



0 


2 

6 

7 

2 

3 

2 

1 

1 


1 

4 

3 

9 

4 

1 

1 

2 


2 


1 

6 

6 

3 

5 


13 

Ten sets of 

readings for the variates x, y, and 

z are given below, 

X 

2 

3 

5 

7 8 

11 

13 16 

18 

19 

y 

8 

5 

12 

11 22 

9 

24 17 

12 

25 

z 

8 

10 

10 

7 12 

14 

15 21 

28 

26 


Calculate the coefficients of correlation between (i) x and y , (ii) y and z, 
and (iii) z and x, and investigate the significance of each. 


Rank correlation: Spearman T rank correlation coefficient 


Six athletes competed in both a 1 500 m race and a 400 m race, and finished 
in the following positions: 


Athlete 

A 

B 

C 

D 

E 

F 

1 500 m 

1 

2 

3 

4 

5 

6 

400 m 

2 

3 

5 

1 

4 

6 


Can anything be said about correlation of performance at these two 
distances? 


If the times for the athletes were given, then the product-moment correla¬ 
tion coefficient for these times could be calculated, as in previous examples. 
Here, however, the only information is about position or rank. Therefore 
the product-moment correlation coefficient of the ranks is calculated, and 
is referred to as the coefficient of rank correlation. 

Since ranks (or positions) are less informative than actual measurements, 
the rank coefficient is correspondingly less informative than the product- 
moment coefficient. However, since only integers are involved, the rank 
coefficient does have the advantage of being easily calculated. It is evaluated 
by means of the formula: 


P = 


614 2 

n(n 2 — 1) 


where n is the total number of ranks involved, and d t the z th rank difference. 
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The derivation of this formula follows the worked examples. 

This rank correlation coefficient was first defined by Spearman in 1906, 
and until recently has been the most widely used. But there are others, so it 
is usually called Spearman's rank correlation coefficient and its significance 
graph is given below: 



Returning to the data for the two races, p is evaluated as follows: 

n — 6 


I^ 2 = 16. 

6x16 

Hence p = 1-- 

P 6(36 - 1) 

= 1 _ 16 
1 35 

_ 11 
— 35 

= 0.54; 

which, using the significance graph, is not significantly different from zero 
at the 5% level. 

Example 4 

What is the rank correlation coefficient for the heights and masses of 
schoolboys given in example 3, and how significant is this coefficient? 
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The first step is to rank (in ascending order) the heights and masses. But 
the problem arises of how to deal with the two equal masses of 45 kg; 
between them, they must occupy the 6th and 7th positions, so by con¬ 
vention each is given the rank 6.5. (It is not difficult to envisage how this is 
extended to more than two equal ranks, each being given the average of 
the ranks they replace.) The heights and masses are now ranked in the 
following table: 


Height/cm 

Mass/kg 

Height rank 

Mass rank 

d 

d 2 

156 

47 

7 

9 

2 

4 

151 

38 

5 

3 

2 

4 

152 

44 

6 

5 

1 

1 

160 

55 

10 

10 

0 

0 

146 

46 

3 

8 

5 

25 

157 

39 

8 

4 

4 

16 

149 

45 

4 

6.5 

2.5 

6.25 

142 

30 

2 

1 

1 

1 

158 

45 

9 

6.5 

2.5 

6.25 

141 

32 

1 

2 

1 

1 






64.5 

So n = 10 

and Yj d 2 

= 64.5 




=> P = 1 - 

6 x 64.5 

10 X 99 






= 1 - 0.39 
= 0.61; 

and using the significance graph, we find that this coefficient is significant 
at the 5% level. (As compared with the product-moment coefficient which 
was significant at the 1% level.) 


Derivation of the formula for Spearman’s rank correlation coefficient 

The product-moment correlation coefficient is required where the two 
variates, x and are ranks, each taking values 1, 2, ..., n in some order 
or other. 

We shall assume the standard results (see chapter 6) 

£ i = 2 n (n + 1) 

1 

£ i 2 = £«(« + 1) (2w + 1). 

1 

Since we need to calculate p = s xy /s x s y , it is necessary first of all to evaluate 
s x , s y and s xy . 
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Now £ X, = Yj yi = Z i = + !) 

1 1 1 

=> x = y = \{n + 1). 


Also YA = Y J y 2 i =Y J i 2 = in(n + 1)(2 n + 1) 

1 1 1 

=> 4 = - Z A - * 2 

n ~ 

= £(« + 1)(2«+ 1) -i(»+ l) 2 

= + l)(4n + 2 — 3n — 3) 

= + l)(n - 1) = iV(« 2 - 1) 

=> = Vfe(" 2 - !)]> and similarly s y = ~ 1)] 

=> v, = T2(« 2 - !)• 


We now use the identity: 

x&i = jxf + \y\ - £(*, - yi ) 2 

=> = iZ*? + 2 Zy? - £Z (*i - yi ) 2 

= £n(» + 1) (2n + 1) - { Z A 

=> ^Z*tfi = i(» + i)( 2 " + i) _ 
now ^ = 1 Z - ~*y 

y d 2 

=> ^ = i(« + l)(2n + 1) - ^ - i(» + l) 2 


Therefore p = - x - 
Vy 

__ ~ ^ ~ ^ u i | _ y "i 

Un 2 ~ 1) ” n(, 2 -l)* 

It should be noted that we have assumed in these calculations that the 
ranks are all different. If, however, equal ranks occur, £ x f and/or £ yf 
are no longer equal to \n(n + 1)(2 n + 1), and the above formula is no 
longer strictly valid; but the error involved is'generally very small. 



Kendall 9 s rank correlation coefficient 

Another coefficient of rank correlation, known as Kendall’s rank correla¬ 
tion coefficient (t), is derived quite differently, but has the same range of 
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values ( — 1 to 1) as the other correlation coefficients, and values of t 
equal to —1,0, and 1 have the same meaning as r and p. It is a measure 
based on a simple principle of agreements and disagreements between the 
orders in which the objects are placed. 

If we have a total of n objects (A, B, . . .) ranked in two different ways, 

= 7T n(n — 1) different pairs, such as A and B. 

Now for the two rankings, the relative positions (A before B or B before A) 
can agree or disagree. So the maximum number of agreements, and the 
maximum number of disagreements of these pairs are both equal to 
%n(n — 1). Kendall’s coefficient is calculated by subtracting the total 
number of such disagreements from the total number of agreements, and 
then dividing by the maximum possible number of agreements, i.e., 

no. of agreements — no. of disagreements S 

\n(n — 1) \n(n — 1) 


Clearly if both rankings are exactly the same, all pairs will agree in 
relative position, so that the number of agreements will be ^n(n — 1) and 
the number of disagreements will be zero; hence t will equal 1. Similarly 
for total disagreement t will equal — 1. 

The significance graph for t is given below: 


then we can choose 



At first glance it would seem a very lengthy procedure to calculate r, 
but in practice it can be calculated very quickly and conveniently, by 
either of the two methods in the following example. 






372 PROBABILITY DISTRIBUTIONS AND STATISTICS 

Example 5 

In a brass-band competition, two judges place the eight bands in the 
following order: 


Band 

A 

B 

c 

D 

E 

F 

G 

H 

1st judge 

3 

6 

7 

2 

4 

8 

1 

5 

2nd judge 

5 

7 

4 

1 

3 

8 

2 

6 


Calculate Kendall’s rank correlation coefficient for the two judgements. 


Method 1 

Re-order the bands so that they are in the correct order of merit according 
to the first judge. Take the rank given by the second judge to the first 
band, G, that is 2. Now count +1 for every number to the right which is 
greater than 2 and — 1 for each number which is less than 2; these totals 
are given below the rank. Repeat this procedure for the next rank in the 
second row, remembering to count only numbers to the right of it. And so 
on for each rank in the second row. The positive numbers so obtained 
represent agreements, and the negative numbers disagreements, so that 
their sum gives <5, the total number of agreements less the disagreements. 

GDAEHBCF n = 8 


1 2 3 4 5 6 7 8 

2 1 5 3 6 7 4 8 


6 6 3 4 2 1 1 

-1 0-2 0 -1 -1 0 

5 6 14 10 1 <5 = 18 

c 8 18 9 

\n{n - 1) 28 14 


Method 2 

Set out the bands in the order decided by each judge. Then join the same 
letter in each ranking by a straight line. Let c denote the number of crossings. 


1st judge 


2nd judge 


G D 



D G 
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Then 5 = \n(n — 1) —2c 
4c 

=> T — 1--- 

n(n — 1) 

5. 

8 x 7 

1 _ 

1 14 

^ = 0.64. 

(The justification for this second method is left as an exercise for the reader.) 

So Kendall’s rank correlation coefficient is in this case equal to 0.64; 
which, using the graph, is found to be significant at the 5% level. 


In this case, c = 
So t — 


Example 6 


Calculate t for the race positions given earlier 

A B C D E F 

in this section. 

n = 6 

1 500 m 1 

2 

3 

4 

5 

6 


400 m 2 

3 

5 

1 

4 

6 


4 

3 

1 

2 

1 



-1 

-1 

-2 

0 

0 



3 

2 

-1 

2 

1 


d = 7 

Hence t = - 

7 


7 

— = 0 . 47 . 



1 X 

6 x 

i = 

15 





and this value of r compares with the value (already calculated) of Spear¬ 
man’s coefficient, p, equal to 0.54. 


Exercise 12.7c 

1 Six varieties of chocolate, denoted by A, B, C, D, E, F are ranked for 

taste in the following order (best first): F C A D B E. The ranking on price 
(most expensive first) is C F B A E D. Calculate a rank correlation co¬ 
efficient, and comment on the result. (c.) 

2 At a village fete the fruit cake competition was judged by the vicar and 
the local squire. They placed the entries as follows: 


Cake 

A 

B 

C 

D 

E 

F 

G 

H 

I 

j 

Vicar 

5 

1 

9 

2 

3 

8 

7 

4 

10 

6 

Squire 

3 

5 

4 

7 

10 

8 

2 

1 

6 

9 
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Calculate a rank correlation coefficient. Can you deduce anything about 
the vicar’s and squire’s tastes in fruit cake? 

3 Twelve chemistry students were each given a theory and a practical 
examination. Their positions in these two examinations were as follows: 


Theory 

1 

2 3 4 5 6 7 8 

9 

10 11 

12 

Practical 

1 

4 7 3 5 2 9 8 

10 

6 12 

11 


Calculate (i) Spearman’s and (ii) Kendall’s rank correlation coefficient 
for the data, and test the significance of each coefficient. 

4 Rank the data of exercise 12.7a, no. 4, and calculate Spearman’s rank 
correlation coefficient. 

5 Rank the data of exercise 12.7a, no. 5, and calculate Spearman’s rank 
correlation coefficient for the lengths and breadths of leaves. 

6 Rank the data of exercise 12.7a, no. 6, and calculate Spearman’s co¬ 
efficient. Is the rank correlation coefficient between parity and yield 
significant? 

7 Rank the data of exercise 12.7a, no. 7, and calculate (i) Spearman’s 
and (ii) Kendall’s rank correlation coefficient for the chronological and 
reading ages of the children. 

8 The batting and bowling averages of seven of the members of a cricket 
team are put in order as follows: 


Player 

A 

B 

C 

D 

E 

F 

G 

Batting order 

1 

2 

3 

4 

5 

6 

7 

Bowling order 

7 

4 

6 

5 

3 

1 

2 


(i) Calculate Kendall’s rank correlation coefficient between the two 
orders. 

(ii) Is this significant at the 1 % level? 

(in) If the positions of A and C in the bowling order were reversed, show 
that Kendall’s r.c.c. would be significant at the 5 % level. (s.m.p.) 

9 A, B, C, D, E was the order of merit for the five marrows entered in a 
local vegetable show. One admirer put them in the order B, A, C, D, E; 
show that he got 0.8 as a Kendall coefficient of rank correlation on com¬ 
paring his order with the official result. List all other possible orders which 
would give the same coefficient when compared with A B C D E. How 
many different orders could there be for the five marrows? How likely is 
he to obtain a coefficient as high, or higher than he did, purely by chance? 

(s.m.p.) 

10 At the end of a particular season, the goals scored by the ten teams 
forming a football league, as compared with their league positions were as 
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follows: 


Position 

1 

2 3 

4 

5 

6 

7 

8 

9 

10 

Goals scored 

34 

19 31 

24 

26 

20 

22 

18 

19 

15 


By ranking the number of goals scored (in descending order), calculate a 
rank correlation coefficient between league position and goal score. Is this 
coefficient significant? 

11 Eleven boys from the same class took part in a sponsored walk for a 
charity. The distance each walked, and the amount per kilometre that he 
earned are given below. Calculate the Kendall rank correlation coefficient 
for these two sets of figures. 


Distance/km 

25 

23 

22 

21 19 18 12 10 9 5 3 

Rate/p km - 1 

14 

16 

12 

13 11 7 9 6 10 4 8 


Comment briefly on the result. (s.m.p.) 


Miscellaneous problems 12 

1 In a game of Ludo a six has to be thrown with a die before each of a 
player’s four counters can be moved. Find an expression for the probability 
that a player will be able to move his fourth counter with his tenth throw. 

Find the probability, for a binomial distribution with parameters 
n , p and q , that r trials are required to obtain k successes. 

2 A certain device, used for measuring Earth tremors in a seismological 
station, can only maintain the required degree of sensitivity for a total of ten 
tremors, and after this is disposed of. If such tremors occur randomly, 
but averaging 4 per day, find the probability density function for the useful 
lifetime (in days) of such a device, and use this to show that the mean life¬ 
time is days. What is the variance of the lifetime? 

Find the p.d.f. if the device can be used for k tremors and tremors 
average k per day; the probability distribution with this p.d.f. is referred 
to as the gamma distribution. 

3 For a certain type of bacterium, the time x from birth to death is a 
random variable with p.d.f. (1 + x) ” 2 , (x ^ 0). A culture of such bacteria 
is routinely inspected at regular intervals of time. 

One bacterium, inspected at time t after birth, is found to be dead already. 
Find the probability that this bacterium has been dead for at least time 
kt (0 < k < 1). 

Another bacterium of the same type is alive at time t after birth, but is 
found to be dead by the end of a further time t. Find the probability that 
this bacterium had been dead for at least time kt( 0 < k < 1). (m.e.i.) 

4 When a patient A arrives at a doctor’s consulting room he will be seen 
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at once, and will occupy the doctor for 30 minutes; he is equally likely to 
arrive at any time between 2.00 and 3.30 p.m. Patient B, who comes inde¬ 
pendently, is equally likely to arrive at any time between 2.00 and 4.00 p.m. 

(i) What is the probability that B will arrive while the doctor is seeing A? 

(ii) What is B’s expected waiting time? (o.s.) 

5 In a certain large population of men, heights are distributed Normally 
about a mean of 180 cm with standard deviation 5 cm. Random samples 
are taken with three men in each sample and their heights are arranged in 
increasing order. In 1 000 such samples, approximately how many will 
have: 

(i) the middle height under 175 cm? 

(ii) the least height less than 175 cm? 

(Hi) the least height between 175 and 180 cm? (s.m.p.) 

6 The number of eggs laid by a farmyard hen in a week is a Poisson 
variable with mean A. The chance that an egg is fertile is p. Write down the 
probability that in one week n eggs are laid, and r of these are fertile. 

By summing this probability over appropriate values of n find the 
probability that r fertile eggs are collected in one week. 

Show that if this number r is given, then the distribution of (n — r) 
conditional on it is of Poisson form, and that the mean of n is r + A(\ — p). 

(c.) 

7 By means of probability generators, calculate the expected value and 
variance of the number of throws of a normal unbiased die required to 
obtain the sequence 1 2 3 4 5 6 (not necessarily consecutively). 

8 Matches are put into a box five at a time until the mass of the box and 

matches combined reaches M g when the box is said to be full. The mass 
of an individual match is Normally distributed with mean m g and stan¬ 
dard deviation a g. The mass of an empty box is Normally distributed 
with mean 5 m g and standard deviation 2 <J g. Find the value of M such 
that there is only one chance in a hundred that a full match-box contains 
fewer than 50 matches (c.s.) 

9 Each of four players is dealt 13 cards from a pack of 52 which contains 
4 aces. Player A looks at his hand and winks at his partner, Player B which 
is a pre-arranged signal that his hand has at least one ace. Player B winks 
back to show that he has at least one ace as well. Player C looks at his hand 
and sees that he has just one ace. From Player C’s point of view what is the 
probability that his partner, Player D has at least one ace if: 

(i) he saw the winks and understood their meaning; 

(ii) he knows nothing about his opponents’ signals. (c.s.) 

10 R is the number of successes in a binomial distribution with para¬ 
meters n, p; S is the number of failures. 

(i) Find the covariance of R and S. 

(ii) Use the expectation and variance of (R — S), together with a suit- 
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able approximation to the distribution of (R — S ) when p = to find X 
such that 

p(\R - S\ ^ X) = 0.05. 

(m) Calculate the probability />(|7? — 6 *| ^ 8 ) when w = 16 and p = 
to two significant figures, using the exact distribution. (m.e.i.) 

11 (z) If , T 2 , A 3 , . . ., A n are certain events, we know (from chap¬ 
ter 7) that 

+ A 2 ) = p(A x ) + ^ 2 ) “ />(^M 2 )- 
Find corresponding expressions for p(A y + A 2 + A 3 ) 
and p(A 1 -h A 2 T * * * + A n ). 

(ii) n boys go to a riotous party, each with a girl; and though the party 
ends in complete confusion, each boy also leaves with a girl. What is the 
probability that no boy leaves with the girl he brought, and to what value 
does this probability tend for a large party? 

12 Initially a machine is in good running order but it is subsequently 
liable to break down. As soon as a breakdown occurs repairs begin. If the 
machine is in good order at time t then the probability that a breakdown 
occurs in a small interval (t, t + St) is a St, and if it is under repair at 
time t the probability that the repair is completed in time (t, t + St) is 
jS St. Let p(t) be the probability that the machine is under repair at time t. 

Write down an equation relating p(t + St) to p(t) and hence show that 
p(t) is a/(a + p) {1 — e' {a+p)t }. (c.s.) 

13 Two jars, one white and the other black contain a + b balls each; in 
the white jar there are a white and b black balls and in the black jar b white 
and a black. Single draws are made as follows: at the rth draw a ball is 
drawn from the white or black jar according as the (r — 1 ) th ball drawn 
was white or black, the colour of the ball noted, and then returned. If p n is 
the probability that the nXh ball is white, show that 

(a + b)p n = b + (a - *)/>„_!. 

By means of the substitution 

Pn = i + 

or otherwise, determine , when the jar from which the first draw is made 
is (i) chosen at random, (ii) white. (o.s.) 

14 Three deadly enemies A, B, and C take part in a three-cornered duel 
with pistols. The probability of A hitting his target is 0.6, that of B 0.8, 
and C 1 . They draw lots for the order of firing and each uses his best 
strategy (they all know one another’s capabilities). If they take it in turns 
to keep on firing until just one is left, who is most likely to survive, and what 
is his probability of survival? 
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15 In a game between two players both players have an equal chance of 
winning each point. The game continues until one player has scored N 
points. Find the probability p r that the winning player has a lead of ex¬ 
actly r points when the game is complete. Deduce that 

(2 N-r - l)A + i = 2 (N-r)p, (r = 1,2 

and hence find the expected value of the lead at the end of the game. 

(c.s.) 





Matrices, determinants, and 
linear algebra 


13.i Introduction: matrices and matrix addition 

Matrices as stores of numerical information 

Suppose the boys in a school are placed in 4 sets ’ for mathematics according 
to ability, and that there are four sets in the first and second forms and five 
sets thereafter. We use the notation III 2 , for example, to denote the second 
set of the third forms. If conditions were 'ideal 5 or extremely rigid!, all 
the boys in I x would move up the following year into set IIj; all those in 
IV 3 would go into V 3 , and so on. But of course, in practice, things are not 
like that: if they were, life would be very dull! Some make unexpected 
progress and earn promotion to higher sets; others do the reverse. Some 
develop a dislike of the subject, whilst others acquire a fascination for it. 
So it may be that, of the 31 boys in IV 2 , 4 go into V x the next year, 20 into 
V 2 , 5 into V 3 and 2 into V 4 , none into V 5 . Suppose we wished to give an 
account of the flow of boys through the school. One way would be to 
describe the composition of the various sets in words: 4 of-the 31 boys in II l , 
18 went into IIIj, 6 into II 2 , 4 into III 3 , none into III 4 and 1 into the 
lowest set, III 5 , while d boys left to join other schools, and so on.. 

One diagrammatic way of representing this data might be as follows: 


II t II 2 II 3 II 4 New 
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A similar diagram showing the ‘flow 5 between I’s and IFs, IIFs and IV’s, 
and between IV’s and V’s would give some sort of a picture of the 4 flow ’ 
of boys through the school. Though this may be an improvement on the 
dry prose of the descriptive paragraph, in that one can ‘ see ’ the movement 
by following the various arrows, it may be objected that the representation 
is somewhat lacking in clarity 1 

What is the best way of depicting this information? Surely by means of a 
two-dimensional array, or table, or matrix , as follows: 

From 

_ A 

II, II 2 II 3 II 4 New 

' HI, 18 2 7 2 0 

III 2 6 10 8 3 2 

III, 4 9 8 6 0 

To ■< ° 

1 III 4 0 4 11 10 0 

III 5 1 1 1 12 1 

Leave 2 0 10- 

This is usually written 

18 2 7 2 0 

6 10 8 3 2 

4 9 8 6 0 

0 4 11 10 0 

1 1 1 12 1 

2 0 10 0 

and, as it has six rows and five columns, is called a 6 x 5 matrix. 

This method of displaying the data can scarcely be improved. Each row 
of the matrix shows clearly the composition of the new Ill’s sets, e.g., that 
set III 4 consists of 4 boys who were in last year’s II 2 , 11 from II 3 , 10 from 
II 4 , and 1 new boy. The row, or Tist’, (0, 4, 11, 10, 1) may, in fact be 
regarded as a row vector , and the 6x5 matrix as a collection of six such 
row vectors. Similarly the columns show the distribution of the boys in the 
old II’s, the final column vector 

0\ 

2 

0 

1 

1 

0, 
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is a list indicating the disposition of the four new boys between the five sets 
(the final entry may be deemed to be zero, since a new boy is hardly likely 
to leave before even arriving — though it has been known!). 

A large school with an intake of, say, 300 pupils per year would require 
larger matrices (say 12 x 12) to describe the composition of the new sets. 
A small school with 3 sets per year may possibly have a policy of moving 
boys only in exceptional circumstances, and it could be that a ‘ promotion 
matrix ’ for one level would read: 

( 28 1 0\ 

2 25 1 j. 

0 0 27/ 

Notice that all such matrices are nothing more than stores of numerical infor¬ 
mation (or just tables, if you like) and you can imagine how such systems of 
matrices are useful for describing the stages in an industrial process. 


Matrix addition 

We can now show how matrix addition may be defined. Suppose five boys, 
Don, Joe, Ken, Tom and Vic tabulated their marks in maths (M), physics 
(P), chemistry (C), and biology (B) during each of two exams, as follows: 



M 

1st exam 
P C 

B 

M 

2nd exam 
P C 

B 

Boy D 

55 

42 

71 

60 

70 

56 

82 

57 

J 

35 

62 

29 

73 

46 

58 

34 

68 

K 

69 

48 

40 

57 

73 

55 

52 

78 

T 

24 

38 

47 

49 

29 

32 

47 

58 

V 

70 

68 

67 

85 

76 

63 

61 

83 


Then the following table would give their aggregate scores (out of a 
total 200) 



M 

P 

c 

B 

Boy D 

125 

98 

153 

117 

J 

81 

120 

63 

141 

K 

142 

103 

92 

135 

T 

53 

70 

94 

107 

V 

146 

131 

128 

168 
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We may write this in matrix form as the addition of the two 5x4 matrices 
to give a single 5x4 matrix, thus: 


1 55 

42 

71 

60' 


70 

56 

82 

57 


125 

98 

153 

117 1 

35 

62 

29 

73 


46 

58 

34 

68 


81 

120 

63 

141 

69 

48 

40 

57 

+ 

73 

55 

52 

78 

= 

142 

103 

92 

135 

24 

38 

47 

49 


29 

32 

47 

58 


53 

70 

94 

107 


68 

67 

85 

1 


76 

63 

61 

83 


146 

131 

128 

168 


More generally, we may define the sum of two p x q matrices, where 
essentially each has the same number of rows (p) and the same number of 
columns (q), by asserting that the elements in the zth row and jth column 
of the two given matrices are added to give the element of the zth row and 
jth column of the sum matrix. Again, for example, when p = 2, q = 5, 
we have 

CL j CIq ^3 

b\ b 2 b 3 

_ ( a \ + c \ 

Vh + d 1 

Similarly, as in the previous example, one may possibly wish to add a 
number of‘promotion matrices’; if, for instance, one wished to find the 
aggregate movement of pupils from II’s to Ill’s over a period of ten years, 
one could add those ten 6x5 matrices to give a single 6 x 5 matrix. 

An ordinary football league table may also be regarded as a matrix, the 
rows corresponding to the various teams A, B, C,. . . , and the columns 
giving the number of games played (P), won (W), drawn (D), lost (L), 
goals for (F), against (A), and total points (T): 


a. 


4 a 5 

b A b 


+ 


T 


d 1 d 2 W 3 u 4 


c 4 

d A 


v 4 u 5 

a 2 + c 2 a 3 + c 3 a 4 + c 4 a 5 4- c 5 
b 2 + d 2 b 3 + d 3 b 4 -f d 4 b 3 + d 5 


p 

W 

D 

L 

F 

A 

T 

12 

9 

2 

1 

31 

10 

20 

11 

8 

3 

0 

28 

12 

19 

12 

8 

1 

3 

30 

15 

17 


The result of a particular week’s play might be represented as follows: 

0 1 0 3 3 r 

10 0 1 14 0 

0000000 




and the table 


UCTION : 

MA' 

rRic 

ES . 

AND 

'13 

9 

3 

1 

34 

13 

2l'' 

12 

8 

3 

1 

29 

16 

19 

12 

8 

1 

3 

30 

15 

17 
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informing us of the final state of the teams, is derived from the first two by 
matrix addition. 

Note that each row of the matrix is a list representing the performance of 
one particular team; while the column vectors are lists providing statistics 
about one aspect of all the teams’ performances, e.g., column (D) gives 
the whole story of the league from the point of view of matches drawn. 


Exercise 13.1a 

1 The following is a matrix showing the expenditure in £ of a house on 
types of fuel during consecutive quarters in 1973: 


Quarter 

Oil 

Electric 

Solid fuel 

1 

40 

15 

12 

2 

28 

9 

7 

3 

21 

11 

2 

4 

37 

8 

9 


The corresponding matrices for 1974 and 1975 are: 



and 



Sum the three matrices and interpret. 


2 Five bowlers had the following figures at the end of three test matches: 


| Of course, if the final table is also to show the teams in order of merit, there will usually 
have to be a reshuffling of the rows, and this could be effected by ‘ pre-multiplying’ by a 
so-called ‘permutation matrix''. 
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Bowler 

Overs 

Maidens 

Runs 

Wickets 

A 

no 

26 

213 

15 

B 

89 

11 

305 

18 

C 

50 

8 

110 

7 

D 

25 

2 

64 

5 

E 

26 

10 

33 

3 


The performance at the end of the series of five matches was given by the 
5x4 matrix: 


213 

41 

418 

32 

89 

11 

305 

18 

80 

12 

198 

11 

25 

2 

64 

5 

37 

14 

90 

8 


Set up a matrix showing the performance of the five bowlers over the last 
two g^mes. 


3 The matrix below shows the readings of barometer, thermometer, rain¬ 
fall, windspeed and wind direction at three stations A, B, C: 



Barometer 

Thermometer 

Rainfall 

Wind Speed 

Direction 

A 

1 021 

24 

13 

14 

280 

B 

1 053 

29 

0 

29 

350 

C 

998 

18 

35 

40 

085 


Changes in the next 24 hours are recorded in the following matrix: 

( +20 -2 +6 +12 -15 

+ 8 -7 +21 -11 +20 

-11 0 0 -22 +5 

Record the final state of the weather at A, B, C as a 3 x 5 matrix. 

4 If A, B, C are p x q matrices, and + denotes matrix addition, prove 
that 

(A + B) + C = A + (B + C) 

(the associative law of addition for matrices). 
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5 A is a.p x q matrix. Define — A, the negative of A, as the matrix A with 
every element multiplied by —1. Proceed to define matrix subtraction, 
and evaluate 



Linear dependence and independence: preliminary discussion 

It is worth remarking here that some of these column vectors are not inde¬ 
pendent of each other. For example, in the above table of football results: 

col (P) = col. (W) + col. (D) + col. (L) 



Using the notation/? to represent column vector (P), we might write: 

p = w + d + l or / = p — w — d, etc. (1) 

The fact that the column vectors are not independent means that there is a 
sense in which one of them is redundant — if we know any three of/?, w , d , 
/, then the other may be found from the vector equation (1). Again, we 
have t — 2 w + d , the three columns (T), (W), (D) are said to be ‘linearly 
dependent’. 

It may be observed that columns (F) and (A) are not independent, since 
their totals must be equal. Even so, it would not be correct to describe 
them as a pair of linearly dependent columns. This would only happen when 
a linear relation, of similar form to (1), connected the columns, and this must 
mean, in the case of two columns, that one would have to be a multiple of 
the other. For example, if there were an extra column for ‘points from 
matches won ’: 



then clearly x = 2 w, the two columns having corresponding entries in the 
fixed ratio 2 : 1 . 

Again, suppose there was a column (G) for ‘goal average’ 




386 MATRICES, DETERMINANTS, AND LINEAR ALGEBRA 

= goals for/goals against: 



This column depends very much on columns (F) and (A), but it is by no 
means linearly dependent upon them: we cannot find values of a and /? 
such thatg = of + /?a. 

Indeed, even the first three equations 

3.1 = 31a + 10/? 

2{ = 28a + \2P 

2 = 30a + 15/? 

are inconsistent. 

The relations that do exist are g x = fja 1 , g 2 = / 2 /a 2 , etc., but these 
are not what is needed for linear dependence. 

Note that ‘vector addition’ such as we have encountered in this para¬ 
graph occurs in many other contexts. For example, in adding polynomials, 
we are carrying out a similar process in adding like terms: 

3x 3 - 2x 2 - 11 

Add 5x 2 — lx — 4 


3* 3 + 3x 2 — 7x — 15 


This might be written 

(3 -2 0 -11) + (0 5 -7 -4) = (3 3 -7 -15), 

each polynomial being represented by a row vector or ‘list’ of coefficients. 

The idea of linear independence and linear dependence also arises when 
we consider the solution of linear equations. The pair of equations: 

2x - 5 y - 4 = 0 
- 10 * + 25 # + 20 = 0 

is a perfectly good pair of simultaneous equations, but they do not have a 
unique solution. They have an infinity of solutions for the simple reason 
that the second is simply a multiple of the first, and is in effect redundant, 
the matrix of coefficients 


2 -5 -4 

-10 25 20 


consisting of two linearly dependent rows . 
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On the other hand, the equally respectable pair of linear equations 

2x — by — 4 = 0 
-10* + 25y + 9 = 0 

do not have any solution for the unknowns x, y. Here the first two columns 
and ^ 2 ^ are linearly dependent, but not the rows. We shall 
return to a fuller consideration of this (see pp. 442, 445). 



Exercise 13.1b 

1 Three batsmen have the following analyses. Fill in the blanks, and state 
which rows and which columns are linearly dependent. 



Innings 

Not out 

Runs 

100s 

Completed 

innings 

Average 

A 

24 

4 

1 050 

3 

20 

51.0 

B 

32 

0 

1 507 

5 



C 


9 

967 

5 

19 



2 A boy records the number of ice creams, buns and packets of crisps in 
columns a, c, and e during four successive weeks, and the money spent on 
them (prices 3p, 5p, and 2p respectively) in columns b, d, f. The total 
expenditure in p is shown in g, while h shows the change from £\. 


Week 

Ice-cream 
No. Cost 
a b 

Buns 

No. Cost 
c d 

Crisps 

No. Cost 
e f 

Total 

g 

Change 

h 

1 

4 

12 

8 

40 

11 

22 

74 

26 

2 

0 

0 

10 

50 

9 

18 

68 

32 

3 

5 

15 

3 

15 

5 

10 

40 

60 

4 

3 

9 

4 

20 

0 

0 

29 

71 

Totals 

12 

36 

25 

125 

25 

50 

211 

189 


State which of the columns and which of the rows are linearly dependent 
(taking them in pairs, in threes, etc.). 

3 Decide which of the following row vectors are linearly dependent (in 
twos and in threes): 

« = (2, 5, -3) 

d= (-2, 5, 3) 


b = (6, 15, -9) 

* = (6, -5, -9) 


c = (4, -10, -6) 
/= (0, 30,0) 
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( 2 3 \ \ 

) decide which columns are linearly 
5 6-2 ) 

dependent. Replace one element in row 2 so that the rows may be linearly 
dependent. 


/ 2 

3 

1 

-4 \ 

-1 

2 

0 

2 

\ 3 

1 

1 

- 6 / 


for linear dependence of rows and of columns in pairs and in threes. 

6 Find x so that the three rows may be linearly dependent: 

/ — 3 4 0\ 

8 5 ~ 2 
\ x —1 6 / 

Are the columns then linearly dependent or not? 


7 The atomic weights of C, O, and H are 12, 16 and 1 respectively. The 
matrix shows the numbers of atoms of each element in various compounds, 
and the molecular weight in the final column. 



c 

H 

O 


Water 

0 

2 

1 

18 

Methane 

1 

4 

0 

16 

Benzene 

6 

6 

0 

78 

Alcohol 

2 

6 

1 

46 

Sugar 

12 

22 

11 

352 

Acetic acid 

2 

4 

2 

60 


Show that the row vectors for water, sugar and acetic acid are linearly 
dependent. 


13.2 Linear transformations and their matrices 

Transformations in two dimensions 

It will be remembered from our discussion in the prologue to Book 1 that 
linear transformations! in the plane were governed by equations of the form 

x' = ax + by 
y = cx + dy 


| Confining ourselves to those in which the origin is invariant (i.e., excluding translations). 
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which we expressed as a matrix equation 

a b\ l x] 
c d) 

These may be written more briefly as x' = Mx, where x , x f are the vectors 

(or 2 x 1 matrices, with 2 rows and 1 column), 

a b 
c d 


and M is the 2x2 matrix 


The effect of this transformation is to take the grid of unit squares 

in the x, y plane into a grid of parallelograms in the x\ y' plane, so that the 

unit square OUKV transforms into the parallelogram OU'K'V', 




where 



V' is 


and K' is 


a + b 
c T d 


the area of the basic parallelogram cell being ad — be. (In the ‘singular 
case’, when ad — be = 0, the network of parallelograms collapses into a 
straight line.) The salient results are that the first column of the 2x2 matrix, 



, gives the transform U' of the point U 




while the second column 



gives the coordinates of V', the transform of 


V 


i/’ since w d 



Using this important property we are able to write down at sight the 
2x2 matrices which represent such transformations as rotations about 
the origin, reflections in a line through O, shears, enlargements, and so on. 

For example, suppose we wish to reflect in the line y = x tan 0. We 
see that the image of U is the point with coordinates (cos 20, sin 20). It is 
not quite so easy to ‘read off’ the coordinates of V', but remembering that 
OU' _L OV', it is clear that V' is the point (sin 20, —cos 20). 
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Notice that its determinant (ac — bd) is —cos 2 2d — sin 2 26, and so has 
the value — 1 : this numerical value shows that areas are preserved, while 
the minus sign signifies that the transformation is ‘opposite 5 , i.e., the 
reflection has turned the triangle OUV over, the anti-clockwise sense of 
the lettering OUV being replaced by the clockwise sense of OU'V'. 


Exercise 13.2a 

1 Write down the matrices representing the following transformations: 

(i) Rotation about O through n, ^7t, — \n, 2n/3, —2n/3, 37 r/ 4 , 

3 7t/4, tan -1 (3/4), etc. 

(ii) Reflections in the lines x = 0, y = 0 , x = y, x = —y,x~2y. 

(Hi) An enlargement in O with ratio 3:1; an x-stretch, ratio 2:1; a y- 
stretch, ratio 1 : 3 ; a ^-shear which moves the point ( 1 , 1 ) to the point ( 1 , 0 ). 


2 What transformations are represented by the following matrices: 


(0 

M 

(via 

« 


2 0 
0 3 

1 

-3 


(ii) 


(vi] 


-3 0 

0 -3 

0 -2 
-2 0 


(Hi) 

(vii) 


3 

-4 

2 

0 


cos a 
sin a 

cos 40° 
sin 40° 


sin a 

— cos a 

— sin 40°^ 
cos 40° 


(ix) 
(xi) 


— sm a cos a 
cos a sin a 

2 4 
1 2 


-1 

2 


(xii) 


(iv) 


3 Describe the transformations represented by the eight matrices 
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4 In nos. 2 and 3 above, check the effects of the transformations on areas, 
and compare the values of the determinant ad — be in each case. 

5 Obtain the matrix for rotation about the origin through 6 

( i ) by obtaining x' and y’ in terms of x and y by projection; 

(ii ) by using complex numbers, starting from the equation z r ~ z cis a ; 
(Hi) by the method shown in the text above. 


6 Obtain the matrix for reflection in the line y = x tan 0, by first rotat¬ 
ing the coordinate axes so that the x-axis coincides with the reflection axis 
(operation /?), then performing the reflection in this axis, and finally 
restoring the axes to their original position (R ~ 1 ). 


7 A triangle has vertices 




What is its image under the transformations represented by the matrices: 



(\ 0 \ 

8 Investigate the effect of the transformation represented by I ^ 1 on 


the rectangle with vertices 


9 What is the image of the point 


-2 


lM-iA 


under the matrix which moves 


to 


-2 

2 


and 


to 


10 Show that M = 

1 


2 -1 
-3 4 


operating on any vector in the direction 
/ —3^ 

1 J leaves it invariant. Show that vectors in the direction ^ ^ 

unchanged in direction, but are enlarged by a factor 5. Repeat, using the 
matrix M 2 . 


are also 


11 Find a 2 x 2 matrix which transforms the vector to the vector 

\2 


0 


and leaves [ ^ J invariant. What type of transformation is it? 


12 Show that, if a 2 x 2 matrix represents an isometry , i.e., a distance¬ 
preserving transformation, then it must be either of the form 


( cos 6 sin 6\ 

I . or else ol the form 

\ — sin u cos 6J 


( cos 0 sin 6\ 
sin 6 —cos 6 y 


i.e., a rotation or a reflection. 
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Transformations in three dimensions 


All the above can be extended to a higher number of dimensions. In three 
dimensions, for instance, we can consider the equations 

x' = a t x + b x y + c x z 
y = a 2 x + b 2 y + c 2 z 
z! = a 3 x + b 3 y + c 3 z 

as representing a transformation which maps 


the vector 


into the vector 


Just as in the two dimensional case, this linear transformation which 
preserves the origin may be completely described by a matrix, in this case 
the 3x3 matrix 



so that the transformation may be written 



or more briefly as x — Mx. 

If we label the points (1, 0, 0), (0, 1, 0) and (0, 0, 1) as U, V, W respect¬ 
ively, the transform of the point U will be given by 



i.e., the coordinates of the point U', the image of U, are the first column f of 
the matrix. Similarly, the second and third columns of the matrix give the 
coordinates of V' and W': 



■f It might therefore seem that columns are more important than rows. But this is only because 
we have chosen to represent vectors by columns. Had we used row vectors, and /^-multiplied 
by the transforming matrix, its rows would have given the images of U, V, W. 
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In this sense, the 3 x 3 matrix may be thought of as a collection of three 
column vectors a, 6, c. Conversely, if we wish to write down the nine ele¬ 
ments of the matrix which represents a particular three-dimensional 
transformation of the type described, all we have to do is to find the images 
of the three points U, V, W, and their coordinates will give the three 
columns of the required matrix. 

The cube having OU, OV and OW as adjacent edges is taken by the 
above transformation into a parallelepiped having OU', OV', OW' as adja¬ 
cent edges, and the effect of the transformation on the lattice (i.e., solid 
network) of unit cubes will be to take them into a lattice of parallelepipeds. 


Effect of linear transformations on volumes 


We have seen that the linear transformation whose matrix is 


M — | a 2 b 2 c 2 

\ a 3 ^3 L 3j 

transforms the unit vectors 



But the volume of the parallelepiped formed by a, 6, c has been shown 
(in 11 . 5 ) to be the scalar triple product 

a* ( b X c) == a^b 2 c$ ^\b<^c 2 H - b^c 2 a 2> b\C<^a 2 ~\~ c^a 2 b3 

which is called the determinant f of the Matrix M and written 

A = \M\ (or det M). 

f Just as the area ad — be of the basic parallelogram in the two-dimensional case was the 
determinant of 
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So the transformation whose matrix is M leads to volumes being altered 
by a factor |Af|. 

Determinants will be discussed more fully in 13.5 and 13 . 6 , but at this 
stage we simply note that when the determinant is zero the parallelepiped 
has zero volume, i.e., the three vectors are coplanar: 

\M\ = 0 <=> a, b, c are coplanar. 


In this case, a vector 


into a vector 



in the original space will now be transformed 




= xa + yb + zc, 


which lies in the plane of the vectors a, 6, c. So the entire original three- 
dimensional space is mapped onto a plane, a two-dimensional space; or, 
in exceptional cases, onto a line or even a point. 


Examples of three-dimensional transformations 


Let us set up the 3x3 matrices for transformations in a number of simple 
cases. 

First consider the reflection in the plane UOV (z = 0). The points 
U and V are invariant, and the point W clearly goes to W' (0, 0 , — 1). 
Hence the matrix is 




Next we consider a quarter-turn about the y- axis, the sense being such 
as to take the point W into the point U, i.e., W' is ( 1 , 0 , 0). We see that 
U' has coordinates ( 0 , 0 , —1) and since V is invariant, the required 
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We now ask what transformation is represented by the matrix 


0 

1 


,0 


0 

0 

1 




From the columns, we can read off the positions of U' (0, 1,0) = V, 
V' (0, 0, 1) = W, and W' (1, 0, 0) = U. The diagram makes it evident 
that we are concerned with a rotation through 120° about an axis passing 


through (1, 1, 1). We may check that the vector 



is invariant under this 


rotation, for 



[An important idea is suggested here, that of an eigenvector , whose direction 
is unchanged in a linear transformation.] 
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All the matrices so far considered have included only 0 and 1, but 



represents an #-stretch in the ratio 3:1, while 
3 0 0\ 

0 3 0 
0 0 3/ 

is an enlargement in the ratio 3:1, the vector OP being replaced by 
OP' = 3 OP. This matrix is known as a ‘scalar’ matrix, since its effect on 
every vector OP is the same as the effect of multiplication by the scalar 
number 3. The general scalar matrix 

( £00 
0 £ 0 
0 0 £ 


represents a pure enlargement by a factor £ when £ > 0, but when £ < 0 
it involves ‘reflection through the origin’, or ‘central inversion’, exempli¬ 
fied by the matrix 

-1 0 0\ 

0 -1 01, 

0 0 - 1 / 

whereby every vector is reversed in direction . 

Lastly, we consider a transformation in which the origin is not fixed. 
Under a transformation of the form x = Mx , the origin maps into itself. 
How can one express in matrix form a transformation which does not 
possess this property? The simplest example might be a translation repre¬ 
sented by x = x + a, where a is a fixed vector. In the two-dimensional 
case, we have 

x' = x + 0) 

y' = y + b\ 


and this may be written 





or as 
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Exercise 13.2b 

1 Write down the 3x3 matrices representing the following: 

(i) reflection in the plane y = 0; 

(ii) reflection in the plane y = x\ 

(Hi) reflection in the plane y = x tan a; 

(iv) reflection in the plane x + y + 2 = 0; 

(v) half-turn about the x-axis; 

(vi) half-turn about the line x = y, z = 0; 

(vii ) half-turn about the line x = y = z; 

( viii ) quarter-turns about 2 -axis; 

(ix) quarter-turns about the line x = 0 = y + z; 

(*) rotation through 6 about the y-axis; 

(xi) an enlargement from the origin in the ratio 5:1; 

(. xii ) an orthogonal projection on to the plane y = 0. 


2 What transformations are represented by the following matrices: 



1 0 O' 

0 1 0 
0 0 0 
/ 0.8 
0.6 
\0 


\ (° 0 

' \\ 0 

- 0.6 0 \ 
0.8 0 1 




3 Consider the effects of the above transformations on volumes and check 
by evaluating the determinants in each case. 


4 Show that the matrix 
itself. 




transforms the vector 



into 


5 Check that the vectors 
the effect of the matrix 






are coplanar, and investigate 


6 Find the effect on the cube with vertices 


±1 

+ 1 


of the matrix 
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Repeat for some of the matrices mentioned in nos. 1, 2, 5 above. 

7 Check that, if a 3 x 3 matrix M transforms the vectors x and y into 
the vectors x and y' (i.e., x — Mx and y' = My ), then the vector 
z = kx 4- \iy will be transformed into the vector z = kx -f fJy'. 


8 Show that a glide reflection in the x-axis, in two dimensions, whereby 
each point is reflected in the x-axis and then undergoes a translation 
parallel to the x-axis, may be represented by a 3 x 3 matrix. 


What does the equation 



Describe the effect of the matrices 




represent? 


in the four possible cases of selection of signs. 


13.3 Products of linear transformations and matrices 


Suppose that the general 3-dimensional vector x is transformed to give a 
vector x\ and that this in turn is transformed to give x’ . Let us further 
suppose that these transformations can be represented as 


x" = Ax' and 

X = 

Bx 

/a, 

Pi 

rA 

where A = 1 a 2 

p 2 

y 2 j 

\«3 

P 3 

y*l 


and 


x” = OL x x f + P x y' + y x z ' 
Hence y" = ot 2 x' + fi 2 y' + y 2 z' 
z" = x 3 x’ + P 3 y’ + y 3 z' 



x' = a x x + b x y + c x z 
y' = a 2 x + b 2 y + c 2 zY . 
z ' = a 3 x + b 3 y + * 3 z 


Substitution gives 

x” = a x (a x x + b x y + c x z) + fi x (a 2 x 4- b 2 y 4- c 2 z) 4- y x (a 3 x 4- b 3 y 4- c 3 z) 
= (<x x a x 4~ /?i #2 d~ y\d^)x 4" (oc 1 ^j + Pib 2 4- y\b 3 )y 
4- (ol x c x + (3 x c 2 4- yic 3 )z. 

Similarly we can write down expressions for y” and z", and the three 
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results can be combined in matrix form: 

( cn l a l + )S 1 < 2 2 + y x a 3 cn l b l + P x b 2 + 7 i^ 3 a^i + /?i ^2 + 7i^ 
a 2 «! + j?2 a 2 + 72^3 a 2^1 + i®2^2 + 7 2 ^3 a 2^1 + ^ 2 C 2 + 7 2 ^3 

«3 fl l + 03*2 + 73*3 <*3*1+03*2+73*3 <*3*1+03*2+73*3 

or #" = C#, 

where C represents the combined effect, or product , of the two transforma¬ 
tions. 

But #" = Ac' and 
can also be combined to give 
#" = A (/fa), 

so we now call C the product of A and 5 and write 
AB = C and x" = ( AB)x : 



Hence we see that the products of 3 x 3 matrices are formed just like 
those of 2 x 2 matrices (see 0 . 7 ) by ‘row and column multiplication’. 
More generally, we define the product AB of two matrices A and B in 
just the same way, the element of A# in the it h row andjth column being 
called the inner product of the zth row of A with thejth column of B. This, of 
course, necessitates that these two have the same number of elements, i.e., 
that the number of columns of A is equal to the number of rows of B , such 
matrices being called conformable for multiplication. 

Hence if A is an m x n matrix 
and B is an n x p matrix, 

AB will be an m x p matrix. 


Example 

Find AB and BA when 
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(ii) A = (1 2 3) 



(0 



/ 14 —14\ 

V—14 14/’ 

/0 6 12 \ 
0 8 16 . 
\0 10 20 / 


(ii) 


AB = (1 2 



(40 46); 


whilst BA 


/ 4 5 \ 

I 6 7 1 (1 2 3) does not exist. 

\8 9/ 


Exercise 13.3a 


1 






0 2 \ 
-3 1 )■ 


Find where possible AB , BA, CD, DC, DB , 2?D, DA, AD, CA, AC, BC, CB. 


2 Find the missing elements: 



(unit matrix) is an identity for post-multiplication of matrices with 
3 columns, and also for pre-multiplication of matrices with 3 rows, while 
MI = IM = M where M is any 3x3 matrix. Generalise. 


4 


If 





find AB and BA. Find other counter-examples to illustrate that 
AB = 0 A = 0 or B = 0, where 0 is the null matrix. Deduce a counter- 
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example to show that AC = BC =/> A = B. Obtain examples with 3x3 
matrices, taking 


/“ 2 

1 

°\ 

A =\ 1 

-3 

-0 

\ 1 

7 

3/ 


5 Represent the following five quadratic equations by a single matrix 
equation: 

2x 2 + 5* — 8 = 0; x 2 — 7x + 11 = 0; 3x 2 = 4x 4- 1; 

5 — x — x 2 = 0; 4x 2 + 3x = 0. 

6 If A is a 2 x 2 matrix such that A 2 = 0 , find the general form of A. 
Repeat when A 2 = I. 


7 Find all the matrices that can be obtained by forming products (e.g., 
ABA 2 ) from 


A = 




Repeat with 





(Note that in this question i 2 = — 1, a > 3 = 1.) 


8 Show that the only 2x2 matrices which commute with every 2x2 

, fk 0'S 

matrix are those 01 the form 1 


If A = 


a 


0 k 

B = 


l 

a 2 a \) \~^2 
find AB and BA. What does this result remind you of? 


. Extend the result to 3 x 3 matrices. 

b\ hr, 


J 2 

Kr 


10 Find the products: 



2 5-2V2Y 

5 - 272 2 


11 If A — 


find A 2 , A 3 , A 4 ,.... 


2 -3> 

,-3 5 j 

If AX = XA where X is a 2 x 2 matrix, show that X may be expressed 

(p 1 


. i r i 
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Find the general form of matrices which commute with a given matrix 
a b\ 


prove that M 2n+1 = M. 

find C 2 , C 3 , . . . and show that C 12 = /. 


12 If M = 


13 If C 


14 Express 



in the form 



15 If A, B , C are 2x2 matrices, prove that {A x B) x C — A x (B x C) 
(associative law of multiplication). Extend the proof to apply to 3 x 3 
matrices. How can the proof be generalised to apply to any three matrices 
which are conformable for multiplication? 


16 Interpret row by column multiplication of two vectors r l ,r 2 in 3-space, 
expressed by the equation 

(x 2 \ 

(*1 yi Zi) ly 2 1 = x i x 2 + y\.y 2 + *1*2- 


Show that this ‘inner product’ of the two vectors r l5 r 2 has the value 
r x r 2 cos 0 , where r x , r 2 are the lengths of the two vectors, and 9 is the angle 
between them. How may the concept of 4 angle ’ be extended into more 
than three dimensions? 


17 Express 


(x y 


( a h g\ 



as a single expression, and show how x 2 + y 2 + 2 gx + 2fy + c may be 
similarly expressed. 

Repeat with the expression 2x 2 + 4 y 2 — 6xz 4 - 3 yz. 


18 Investigate the effect of pre-multiplication of a column vector of 
dimension n by an n x n matrix containing n Is and the rest Os, each row 
and each column of which contains a single 1. Repeat for post-multiplica¬ 
tion of a row-vector (such matrices are known as 4 permutation matrices ’). 


19 Interpret the matrix equation 

/ 0 1 \ / 2 3 -1 0 \ = /—1 5 0 - 4 \ 

V-i oJv-i 5 0-4J v-2-3 1 °/ 

as a geometrical transformation applied to a certain quadrilateral. 
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Xo ... x, 


20 The equation I y\ y’ 2 . . . y' n = 


z 9 . . . z. 



may be interpreted as follows: the n points j y r I (r = 1, 2,. . n) are 


transformed by the matrix (a b c) into the n points [ |, so that 

we are thinking of each of the first and third matrices as collections of 
column vectors. Interpret in this way the equation: 

1 4 0 7 -1\ /3 0 

2 -3 2 0 — 1 | = j 1 4 

3 0-5 1 2/ \2 -3 

Successive transformations 

We may now consider the combined effect of certain successive three- 
dimensional transformations (which would be difficult to draw or even to 
visualise) by simply applying the multiplication rule for the corresponding 
matrices. For example, the products 


1 ° 

0 

l \l 


0 

0 

\o 

1 

0 /' 



and 


f \ 0 0\ / 0 1 O' 

0 0 1-1 0 0 
\0 1 0 / \ 0 0 1 
0 1 0\ /I 0 0 

-1 0 0) 0 0 1 

0 0 l /\0 1 0 



( 1 ) 


( 2 ) 


illustrate (1) that the quarter-turn about the z-axis, represented by the 
matrix 

0 1 0^ 

-10 0 

0 0 ly 

followed by the reflection in the plane y = z 5 represented by 
'1 0 0\ 

0 0 1 


,0 1 o y 
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combine to give a transformation represented by 


i 01 

0 0 

\-l 0 



(2) that if these operations are commuted, a different transformation 
results. (In both cases, the composition transformation is not a ‘simple’ 
one, being known as a rotatory reflection or rotatory inversion (see Coxeter, H., 
Introduction to geometry, Wiley, 1961, p. 99.) 

On the other hand, the scalar matrix 

(* 0 0 \ 

0 k 0 

0 0k 


does commute with other 3x3 matrices; so that, for example, when the 
quarter-turn 



is combined with the enlargement 


( 3 0 O' 

0 3 0 

0 0 3, 


the result is the same when the order of operations is reversed, being repre¬ 
sented by 


0 O' 
0 3 
-3 0 


in each case. 


Suppose next we consider the transformations represented by the matrix 
product 


( 0 0 
0 , 
-1 0 




Each matrix on the L.H.S. represents a quarter-turn, but it is not obvious 
what the product represents. We consider its effect on the general vector 
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and ask under what circumstances such a vector would be unmoved by the 
transformation. This would require x = y, y — — z, z = — x , i.e., any 

1 *\ i \ 

vector of the form I k I, or k I 1 I, would be invariant under this 

transformation — another example of an eigenvector. We suspect, for 
this reason, that it is a rotation. Surprisingly, the rotation is through 120°, 

/ n. i 'i 

for if the matrix is applied again to I — z I it becomes I x |, and after 

the effect of the same two quarter-turns when they are (a) reversed in sense 
(one at a time), (b) commuted, (c) both. Finally, observe that, 



the transformation may be analysed into a 120° turn about the line 
x = y = z, represented by 

^0 1 0 \ 

0 0 1 
M 0 0/ 

followed by a half-turn about the x-axis represented by 

T 0 0\ 

-1 0 
o — IV 


c = 


D = 


Exercise 13.3b 

, If A = (°- l \ B =(- 1 ° 

\i 0) V 0 1 

evaluate the products AB, BA, AC, CA, etc., also C 2 , D 2 , ABC, CBA, 
BAD, etc. and interpret geometrically in each case. 


2 O' 

0 2 )' 


1 1 
0 1 


2 Repeat the above question using the matrices 
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3 If R, = 


cos a sin a 


and Ro = 


cos ft sin p\ 

— sin a cos a J " \ —sin ft cos ft j 

find an d 5 an< I deduce certain trigonometrical formulae. 


4 Decompose the following as the products of two matrices representing 
‘simple 5 two-dimensional transformations such as rotations, reflections, 
enlargements, etc.: 


(0 

O) 

(vii) 


-3 0 

0 -3. 

5 -12 
12 5 

1 1 

-1 3 


(«) 

(y) 


0 -3 
3 0 
2 -3 
0 2 


(m) 

(vi) 


1 V3V 
V3 
0 -2 
3 0 


5 Repeat the above question with the 3x3 matrices: 
0 0-2 






-23 14, 


6 Show that if M = \ 29 
\33 

' 1 \ / 1 \ 

then M | 2 | = | 2 | and M 3 = I. 

VI/ \l/ 


What do you suspect? 
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7 Find vectors (‘eigenvectors’) whose directions are unaltered by 

1 1 - 2 °\ 

N = 1 - 10 . 

\-l 0 1/ 

Show that N 4 = /, but that N does not represent a quarter-turn. 


8 Experiment with the composition of rotations of period 2 (half-turns), 
3 and 4 about various axes as was done in the last paragraph of the text, 
and also with reflections in various planes. 

(A matrix M is said to have ‘period w’ if M r = I when r = n, but 
M r ^ I when r < n.) 


9 Construct 2x2 and 3x3 matrices with period 2, 3, 4, 5, ... . 

10 If a 3 x 3 matrix has the property M 2 = I and MX = X , show that 
it may well not represent a half-turn about the vector X. What conditions 
are needed in addition to the two given in order that it may represent a 
half-turn ? 


11 M is a certain 3x3 matrix whose elements are 0 , 1 or — 1 , and each 
row and each column of M contains exactly one non-zero element. Prove 
that M 2 , M 3 ,..., M n are all of the same form, and deduce that M h — I 
for some positive integer h ^ 48. Interpret the action of M on a vector 
(x, z) geometrically. (m.e.i.) 


12 If the matrix of a plane translation is 


/ l 

0 

fl\ 

l l 

0 

an\ 

0 

1 

b 

|, show that 7” = j 0 

1 

bn I 

\o 

0 

1 

\o 

0 

1 / 


Investigate, by using 3x3 matrices the composition of translations with 
other types of plane transformations. 


13 Interpret the plane transformations 



cos 9 — sin 9 0 

sin 9 cos 9 0 

0 0 1 



and expand the product of the three 3 x 


3 matrices. 


13.4 Inverse transformations and matrices 

So far we have discussed the effect of a particular transformation on a given 
vector: ‘What becomes of it?’ We now ask the reverse question: ‘Under 
such and such a transformation, from where does a given vector originate? ’ 
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Or, in matrix notation, 

if x = Mx , can we find a matrix iVsuch that x = Nx '? 

If so, it will clearly follow that 

x = Nx' = NMx 
and = M* = MAfa'. 

So, if these are true for all vectors x, x', then MN = NM = /f 

and our problem is to find N which is inverse of M (and which we shall 
write M “ 1 ). 


Two dimensional case 


It will be recalled (from 0 . 8 ) that in the two-dimensional case the trans¬ 
formation 




had such an inverse, provided that its determinant A = ad — be was 
non-zero. 


,f m -C \ 

so that M~ 1 M = 


then M 1 = 
1 


1 


ad — be 
d —b\ (a b 


d -b 
-c a 


ad — be \—c aj \e d 
1 fad — be 0 
ad — be \ 0 ad — be 


1 0 
0 1 


= / (and similarly MM 1 = /). 


If, however, A = 0, there was no such inverse, and the transformation 
together with its matrix M were said to be singular. 



a l b x 


Three-dimensional case 
Let us now consider the transformation 
x' = a x x + b^y + c 1 z > 
y' = a 2 x + b 2 y + c 2 z > or 
z’ = a 3 x + b 3 y -f c 3 z J 
f We shall use I to denote the identity (or unit) matrix of any order. So here, for instance, 

/i 0 0\ 

/ = I 0 1 0 1; but in the two-dimensional case which follows, / = ( 1. 

\0 0 1 / 
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f x \ 

Our problem is to find the vector I y I from which the vector I y’ I originated. 


We start by asking the simpler question of where the vector | 0 | originated, 

vOy 

i.e., we try to find | y | such that 

a x x + b x y + c x z =1 ( 1 ) 

a 2 x + b 2 y + c 2 z = 0 ( 2 ) 

a 3 x + b 3 y + c 3 z = 0. (3) 

Looking at the last two of these, we see that they are requiring us to find 
/x\ fa 2 \ ta 3 

a vector I y | which is perpendicular to both I b 2 1 and I b 3 


Now we know (see 11 . 5 ) that their vector product is precisely such a vector, 
and this we abbreviate by writing the vector product thus: 



Similarly, we write 
^ b 3 c 1 b x c 3 \ l A<y 


c^a x - \ = \ B 2 \ and 


v 3 u l L 1 u 3 

\a 3 b x - a x b 3 ) 


(b x c 2 


\^\b 2 a 2 b x 



(A 9 \ 

H 

*3 

\cj 


We now consider the matrix Af* formed from these capital letters, with the 
A's forming the first row: 

( Ai A 2 M 

M* = ( 5, B 2 B 3 1. 


\C\ c 2 c 3 
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This is called the adjoint matrix , and we now consider the product 


MM* = 




( a 1 A 1 b 1 B 1 
a 2-^41 + b 2 B \ d - ^2^1 
a 3 A x + b 3 B x +c 3 C x 


< 21 ^4 2 "4” b ±B 2 + C 2 


d" ^1^3 d" ^1^3 


^ 2-^42 d~ b 2 B 2 A- c 2 C 2 
a 3 A 2 4" b 3 B 2 4“ £ 3^2 


# 2^3 d - b 2 B 3 -b c 2 C 3 I. 
a 3 A 3 + b 3 B 3 + r 3 C 3 / 



is, by definition, perpendicular to [ b 2 | and I b 3 


So a 2 A 1 + b 2 B x + c 2 C x — §&n<\a 3 A x + b 3 B x 4- c 3 C x = 0 (as may be 
verified by direct substitution). Indeed, the only terms in MM* which 
do not vanish are those on the leading diagonal: 

a x A x -j- b X B j 4 - c x C x , ^2^2 d - ^2^2 d - £2^2? ^ 3^3 d“ b 3 B 3 -\- c 3 C 3 . 


But each of these is quickly seen to be equal to the expression 

A — ^ 1 ^ 2^3 ^ 3^2 d - ^ 1 ^ 2^3 ^ 1 ^3^2 d - ^1^2^3 ^ 1 ^ 3 ^ 2 ? 

which we have already called the determinant of M. 


/A 0 0\ 

So MM* — I 0 A 0 I = A/, where A = det (M) 
\0 0 A/ 


and it can similarly be proved that 
M*M = A/. 


Now if A ^ 0, it follows that 


M* 


M 


M—- = /, 

A 


so that M*/A is an inverse of M for both pre- and post-multiplication and 
we can write 

M* 

M 1 — (provided A 7 ^ 0). 


It is immediately clear that M 1 is unique, for if we suppose that L is 
another inverse for pre-multiplication (or left inverse) such that LM = /, 
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then M~ l = IM 1 = (LM)M~ l = L(MM ~ X )t = LI = L 
and Z, is identical with M ~ 1 . 

Similarly, M _ 1 is the unique inverse for post-multiplication. 

If, however, A = 0, then M has no such inverse and is said to be singular. 

Example 1 

Find the adjoints and the inverses of the matrices: 

( 1 0 
0 0 2 
0 -1 0 ) 



Z 1 

0 

°\ 

2 

0 

°\ 

p 

0 

2 

we see that A* = 1 0 

0 

- 2 

\0 

-1 

0 / 

\o 

1 

0/ 


AA* = 


A- 1 =±A*= 0 0 



(ii) Since B = 


BB* = 


Hence A = 0, so that B is singular and does not possess an inverse. 
Example 2 

If A and B are both non-singular matrices, show that: 

(i) (A- 1 )' 1 =A; (ii) (.AB ) _1 =B 1 A 1 . 

(i ) The inverse of A is A ~~ 1 , 
so AA~ l = A~- X A = I. 


| Using the associativity property of matrix multiplication, (see Exercise 13.3a). 
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Hence the inverse of A ” 1 is A 
and = A. 

(«) (AB)(B- 1 A~ 1 ) =A(BB~ 1 )A l = AIA ~ 1 = AA~' = I 
and {B~ l A~ l ) ( AB) = B \A l A)B = B l IB = B~ X B = I. 
So the inverse of AB is B'A 1 
or (AB) -1 = 


Example 3 

Given the transformation 

x' = x + 2y -f- 3z 
y' = 3x + y + 2z 
z' = 2x + 3y + z, 

find: ( 1 ) its inverse transformation; 

(ii) the point from which (1, 2, 3) arises under the given 
transformation. 


The matrix of the original transformation is 




and we can easily find its adjoint matrix 



Furthermore, it is seen that 


AA* = 



/18 0 0 \ 

0 18 0 1 = 187. 

\ 0 0 18/ 


So the determinant A of the transformation is 18, the transformation is 
non-singular and 
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Hence the inverse transformation is 

* = Ts(- 5 *' + V + z ') 

y = Ta(*' - V + 7z') 
z = 18(7*' + y' - 5z'). 

Finally, putting x' = 1, y' = 2, z' = 3 we see that the point (1, 2, 3) 
arises from * = §, y = §, z = — , i.e., from the point (§, f, —j). 

Example 4 

Solve the equations 

«i* + b x y + CjZ = d x 
a 2 x + b 2 y + c 2 z = d 2 
a 3 x + + c 3 z = d 3 . 

These may be written as 



Case (i ) If M is non-singular, then it possesses an inverse M 1 and 

Mx = d 

=> M~ 1 (Mx) = M~ 1 d 

=> (M~ l M)x = M~ l d 

=> lx = M~ 1 d 

=> x = M l d. 

In a particular case, with numerical values for all the a’s b’s and r’s, we 
could find the matrix M~ l and hence solve the equations. But this is a 
tedious process and, though we shall investigate it further (in 13 . 6 ), we 
shall also (in 13 . 7 ) learn how more practical methods can be employed. 

Case (ii) If M is singular, the above method breaks down as M~ x does 
not exist. But we shall see (in 13 . 6 ) that a useful result can still be obtained 
simply by using the adjoint matrix Af* rather than M*/ A. 

Example 5 

Given l\ + m\ = l\ + m\ — 1 and lj 2 + m i m 2 = 0 ; 

prove that l\ + l\ — m\ + m\ = 1 and l 1 m 1 + l 2 m 2 = 0 . 

First of all the reader should attempt to prove this by ordinary manipula¬ 
tion of the equations. Even if successful, it will be agreed that the task is 
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vexing. But how much simpler it is to use matrices. For if we let 

A = ( ll m ‘) and B = ( h 4 
\l 2 m 2 ] \m l m 2 

we immediately see that 

f i\ + m\ lj 2 + m l m 2 
\l\l 2 4 m l m 2 l\ + m\ 

' 1 0 
^ 1 

Hence A and B are inverses, so that 


AB = m A 


lo 


m< 


= I. 


BA = 


h h \(h m \ 


m 1 m 2 J \L 


nic 


l\ 4- l\ 


l 1 m l + l 2 mr 


l 1 m 1 + l 2 m 2 m\ + m 


1 0 
0 1 


=> l\ + l 2 = m\ + m\ — 1 and l 1 m 1 4 - l 2 m 2 = 0 . 

Should it be protested that the saving in work is not very great, consider 
an extension of the given problem into three dimensions: 

Given the equations 


/2 

L \ 

4- 

m\ 

4 n\ = 1 

(i) 

l 2 1 3 

4 

m 2 m 3 

4 n 2 n 3 = 

0 

( 4 ) 

l 2 

l 2 

4- 

m\ 

4 nl = 1 

( 2 ) 

/ 3 /i 

4 

m 3 m 1 

+ n 3 n 1 = 

0 

(5) 

1 2 

1 3 

4- 

m 2 3 

+ ri\ = 1 

(3) 

hk 

4 

m x m 2 

4 n i n 2 = 

0 

(6) 

prove that 








n 

4 

1 2 

1 2 

+ I 3 = 1 ; 

ml 4 m\ 

+ = 

i; 

n 

1 4 n\ 4 

n\ = 1 

i; 


and + m 2 n 2 4 - m 3 n 3 = n l l 1 + n 2 l 2 + n 3 l 3 = l 1 m l + l 2 m 2 + l 3 m 3 = 0 . 

To attempt this by ‘ordinary algebra’ would be prohibitive. By matrices, 
it is scarcely any more difficult than the 2 by 2 case above. For we may take 



and B to be its ‘transpose’ (i.e., with rows and columns interchanged), 
and the reader will have no difficulty in completing the proof on the lines 
of the 2x2 case. Note that equations (l)-( 6 ) are represented by the single 
matrix equation AB = L 


Exercise 13.4a 


1 Obtain the inverse of 


a b 
c d 
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(i) by letting it be 


P q 

r s 


(ii) by solving the equations ax + by = x f 

cx + dy = y' 

to obtain x and y in terms of x and y f . 

3 -2 

— 2 2 

which is mapped by this matrix into the unit square 


2 Find the inverse of 


and hence find the quadrilateral 

±1 N 
+ 1 , 


3 Show that the inverse of the product ABCD... is ... D l C X B l A \ 
where A, 2 ?, C, D ,. . . are square matrices of the same order. 

4 A 2 x 2 matrix M has the property that 


M 


and M 1,1 = 

idj 


Prove that, provided a , b , d satisfy a certain restriction, then M = VU 1 , 

where u -(i j) and *-(?, I)- 

State the restriction on a , b , r, d. (s.m.p.) 


5 A and B are matrices such that AB = A , BA = B.\tA is non-singular, 
prove that A = B = /. If A is singular, prove 1? is also singular, and that 
(A — B) 2 = 0. (m.e.i.) 


6 Find the adjoint and the inverse of 
the equations 

2x — y + 4z = 1 
4x + 2z = — 1 
3# — 2y + 7z = —3. 

7 Find the adjoint and inverse of M = 


r _i 

4 0 
\ 3 -2 

(-- 


2 I and hence solve 
7/ 


°\ 

1 I and use to find the 

2 / 


column vector which, when transformed by M , becomes 



8 Find the inverse of the triangular matrix 
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9 Find the inverse of 
it is singular. 



b 

a 

c 



and state under what circumstances 


a -f b — 3 

io Solve b + c = 4 by first inverting the matrix 
c + a = 5 




ii Under what conditions does M — M *, where Mis: 



Which types of geometrical transformations may such matrices represent? 


12 Invert the matrix M 


A 5 

matrix M' = I 1 1 

\0 -3 




and also the transposed 


Can you draw any general conclusion about the inverse of the transpose of 
a 3 x 3 matrix? Deduce that the inverse of a symmetric matrix (i.e., one 
for which M' = M) is itself symmetric. 


13 A matrix which satisfies MM' = M'M — /, i.e., whose inverse is its 
transpose, is called an orthogonal matrix. Investigate such matrices ( i ) in the 
2 x 2 case (ii) in the 3x3 case, and give examples of each. 


14 Show that if M is an orthogonal 3x3 matrix, and MX = X\ where 
X , X' are column 3 -vectors, then these vectors have the same length , i.e., 
/x\ / x’\ 


if X = 


and X ' — 


then 


x 2 +y 2 


+ z 2 = *' 2 + y' 2 + z' : 


\z/ \z7 

( cos a cos /? sin a cos /? 
cos a sin sin a sin /? 
sin a — cos a 

How do you account for the result geometrically? 


— sin /A 
cos /? I. 
0 / 


16 If A is a matrix of period n (i.e., A r = I when r = n, but A r ^ I when 
r < «), show that BAB~ X is also of period n , where B is another square 
matrix of the same order as A. 



13-4 INVERSE transformations and matrices 
17 Further practice for finding inverses: 


/ 1 4 -3\ (l 2 3\ 

(0 -2 -5 71 ,(||) lo 1 2 , 

\ 3 3-10/ \0 0 1/ 


4-15 


(Hi) 


11 

-1 



( 6 0 —1\ / 2 -1 0\ [\ a a\ 

3 —3 —2), (v) ( — 1 2 -1 L 00 « 1 « L 

-3 6 3/ \ 0 -1 1/ \fl a 1/ 


(vii) 


(x) 



a A 




where 
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2 x 2 linear equations: three interpretations 

At the beginning of this section, we saw how the inverse of a 2 x 2 matrix 
helped in the solution of two simultaneous equations. Following this, we 
considered briefly the solution of three simultaneous linear equations and 
how the inverse of a 3 x 3 matrix might be found and used. The whole 
question of the solution of linear equations will be dealt with in more detail 
in 13.6 and 13 . 7 , but this is a convenient point at which to make some 
preliminary observations in the case of two equations. 

Consider the equations 

a x x + b x y = c x 
a 2 x + b 2 y = c 2 



If 





these may be written as 



or Mx = c. 


( 2 ) 



they can be written as 



x 


or xa + yb = c. 


(3) 
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Each of these alternatives carries a different interpretation. The two 
separate equations ( 1 ) suggest two straight lines in the x, y plane, whose 
intersection one is trying to find. Equation (2) suggests that one is looking 


for an unknown vector x = 



which is transformed by the matrix 


M = 


into the known vector c = ( 1 j: where did this particular 


\ * */ v*/ 

vector originate under the given transformation? The final version (3) 

starts with three given vectors, 



and we ask what multiples of the first two must be combined to give the 
third, or how can we ‘mix 5 the a and b vectors to give the c vector? 

In general , one expects to get a unique solution to each of these questions. 


Thus, in the case 


3x + 2y = 1 | 

Ax — y — — 6 f 


we obtain x = —1,^ = 2. This means 


( 1 ) that the point common to the two lines with given equations has 
coordinates (— 1 , 2 ); 
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More generally, such a unique solution for x, y may always be found by the 
use of the inverse matrix when this exists , (i.e., when A = a l b 2 — a 2 b l ^ 0). 
For then 

/ b 2 “iA/V\ 

\-a 2 aj\c 2 ) 

b 2 c x — b x c 2 ' _ n l c 2 — a 2 c 1 

a^b 2 a 2 b ^ a ^b 2 a 2 b ^ 



2 x 2 linear equations: singular case 

The exceptional case gives two possibilities, illustrated by the examples: 

(z) 3x — 2y = 1 (zz) 3x — 2y = 1 

6 x — \y — — 1 ; 6 v — 4z/ = 2 . 


Equations (z) have no finite solutions in x, y. The first of our three inter¬ 
pretations shows this by the fact that the lines are parallel, and so fail to 
intersect. The second interpretation is that the transformation represented 


by the matrix 


6 -4 


is singular, and the effect on all vectors in the x, y 


plane is to take them into vectors having the direction 



, for 



/3x - 2z/\ 
\6x — 4y J 


= (3x - 2y) 



No conceivable values of x, y can cause this vector to be 


which 


has a different direction. The third interpretation is that we cannot possibly 
take a certain multiple of a = ( ^ and a certain multiple of b 


and combine them to obtain c = 


, simply because a and b have 


the same direction, i.e., lie in the same one-dimensional space (line), 
whereas c is outside that space. 

Equations (zz), on the other hand, have an infinity of solutions for x, y. 
The lines, taking the first interpretation, are coincident, and so have an 
infinity of points in common. (One of the equations, you may prefer to 
think, is superfluous.) You should think out the second and third interpre¬ 
tations in case (zz) as well. 

Note that the exceptional cases in the solution of Mx = c arise precisely 
in those cases when the equations 


a x x + b x y = 0 
a 2 x + b 2 y = 0 


or Mx = 0 
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do possess solutions other than x = 0. For if A # 0, the only possible 
solution to the homogeneous equations 


a x x + b x y — 0 
a 2 x + b 2 y = 0 


is x = 0 , y — 0 . 


Exercise 13.4b 

1 Give the three interpretations suggested in the text in the case of the 
equations: 

(i) x + 2 y = 2 (ii) bx — 6 # = lOj 

3x — y — 13; — 3x 4- \y = — 6 ^; 

and draw diagrams to illustrate. 


2 Apply the three interpretations to the equations 

T~/~‘ in the cases = ( f) and (’ 

6 x — 3y = b \b J y — 3 J \b 

and give a parametric representation of the infinity of solutions in the 
latter case. 


-1 

-3 


3 What multiples of the vectors 


give the vector 




^ must be combined to 


4 Prove that the equations a x x + b x y = 0, a 2 x 4 - b 2 y = 0 have an in¬ 
finity of solutions if and only if a x b 2 — a 2 b l = 0. Prove however, that in 
this case, the equations a x x + b x y = c x and a 2 x + b 2 y = c 2 do not in 
general have a solution, and find under what conditions they do. 

5 Show that the system of equations 
5x — 6y 4- 3 = 0 

— 3x + \y — 2 = 0 

has a unique solution even though the matrix 
two columns linearly dependent. 


-6 

4 


has the final 


Codes 

Having established the idea of an inverse, we are now in a position to 
consider a possible application of matrices to the coding and decoding of 
messages. One obvious way of constructing a code is to use a number to 
stand for each letter of the alphabet, thus: 
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A 

B 

c 

D 

E 

F 

G 

H 

I 

J 

K 

L 

M 

Row 1 

1 

2 

3 

4 

5 

6 

7 

8 

9 

10 

11 

12 

13 

Row 2 

3 

6 

9 

12 

15 

18 

21 

24 

1 

4 

7 

10 

13 


N 

O 

P 

Q 

R 

S 

T 

U 

V 

W 

X 

Y 

Z 

Row 1 

14 

15 

16 

17 

18 

19 

20 

21 

22 

23 

24 

25 

26 

Row 2 

16 

19 

22 

25 

2 

5 

8 

11 

14 

17 

20 

23 

26 


Row 1 provides too transparent a method. Row 2 is an improvement. It 
might have been a random permutation of the numbers 1, 2, 3, ..., 25, 26, 
but it will be seen that there is a ‘system’, whereby each number in row 1 
has been multiplied by 3 and reduced modulo 26 where necessary, e.g., 
21 x 3 = 63 =11 (mod. 26). The idea of this system is that it makes it 
easy to decode a message without having to keep referring to the crib. For, 
as coding is performed by multiplication of row 1 by 3, so decoding is carried 
out by division by 3, e.g., 11 -r- 3 = 63 -r 3 (mod. 26) = 21, giving U. 
We may write: 

3 x (WORD) -> CODED WORD 

3 -1 x (CODED WORD) -> WORD 

This method has the obvious disadvantage that it may be easily broken 
down on account of each letter of the alphabet being invariably represented 
by the same number — it is a simple ‘replacement’ code. It also suffers 
from the inconvenience of having to reduce modulo 26, a rather awkward 
number. The latter objection may be removed by adding four characters 
as shown: 



5 

? 

□ (space) 

27 

28 

29 

0 


Then it becomes necessary to discard multiples of 30, which is easier. To 
remove the former objection, we abandon row 2,f and re-arrange the 
message, say: ‘COME QUICKLY TO MY ASSISTANCE’ as follows: 

CMDUCLnOMDS I T N E □ 

OEQI KYTQYAS SAC . □ 

(□ means ‘space’). Coding by row l,f we set up a two-row matrix: 


_ / 3 13 0 21 3 12 0 15 13 0 19 9 20 14 5 0\ 
W ~ (l5 5 17 9 11 25 20 0 25 1 19 19 1 3 27 0/ 

which is then to be ‘coded’ by a coding matrix , 


f A random permutation of the set 0, 1, 2, . . ., 29 would be marginally better. 
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c = 



pre-multiplying as follows: 


/3 1\ / 3 13 0 21 3 12 0 15 13 0 19 9 20 14 5 0\ 

\5 2/ \15 5 17 9 11 25 20 0 25 1 19 19 1 3 27 0/ 

/24 44 17 72 20 61 20 45 64 1 76 46 61 45 42 0\ 
\45 75 34 123 37 110 40 75 115 2 133 83 102 76 79 0/ 
/24 14 17 12 20 1 20 15 41 16 16 1 15 12 0\ 

\15 15 4 3 7 20 10 15 25 2 13 23 12 16 19 0/ 

(mod. 30), 


giving the coded message: 

XONOQDLCTGATTJOODYABPMPWALOLS. 


In order to decode, we need the decoder matrix, Z), which will take us from 
CW to W: D(CW) = W. Thus DC = /, so that D (as expected) should 

be the inverse of C, i.e., D = C~ 1 = 

Going in the reverse direction then, we have 

/ 2 —1\ /24 14 17 12 20 1 20 15 4 1 16 16 1 15 12 0\ 

\ —5 3 J \\5 15 4 3 7 20 10 15 25 2 13 23 12 16 19 0/ 

33 13 30 21 33 -18 30 15 -17 0 19 

-75 -25 -73 -51 -79 55 -70 -30 55 1 -41 

9 -10 14 5 0\ 

-11 31 -27 -3 0/ 

3 13 0 21 3 12 0 15 13 0 19 9 20 14 5 0\ 

15 5 17 9 11 25 20 0 25 1 19 19 1 3 27 0/ (m ° * ^ 





and the decoded message can then be read off. Notice that both the word 
spacing and the repetition of letters are completely obscured by this 
method: the three Ss in ‘ assistance’ come out as P, M, and W respectively. 

Of course, the coder C may be chosen in many ways, but in order that 
the decoder D may contain integers only, we must arrange for the determin¬ 
ant of C (= ac — bd) to be + 1. In the above case, |C| = 3 x 2 — 1 x 5 = 1. 

For example, 

if C= ^ Jj, \C\ = -1, and D = C 1 = 

The presence of a zero makes computation easy: 

/2 1\ / 3 13 0 21 .. A _ /21 1 17 

\\ 0/\15 5 17 9 .../ \ 3 13 0 




13 4 INVERSE transformations and matrices 423 

and the fact that the second row of C, (1, 0) simply moves the top row of W 
to the bottom scarcely gives the game away; for the zero moving to the 
bottom row causes the first ‘ word 5 of the coded message to have 5 letters 
instead of 4. 

Evidently the method could be made more subtle by using a 3 x 3 
coder, the message being written out in three rows. The computation might 
be heavy, but use of a computer would avoid the tedium. One of the diffi¬ 
culties would be to find a 3 x 3 matrix with integral coefficients whose 
inverse has the same property. However, you might try with the following 
coder: 

2 1 3\ / 2 

1 0 — 1 1, the corresponding decoder being D= I — 9 

4 2 5/ \ 2 

(first checking for yourself that CD = I). 




Exercise 13.4c 

1 Using row (1) at the beginning of this section (A = 1, B 
with . , ? □ represented by 27, 28, 29, 30, and the encoder 



and decoder 



in the text, 


2, etc., 


construct and decode various messages. (This should be done in pairs, each 
pupil constructing a message and exchanging with his partner.) 


2 Find other suitable encoders and decoders, avoiding fractions by 
arranging for ad — be = + 1. 


3 Code the message, ‘Come quickly to my assistance’, using 



as encoder, and 



as decoder, 


with the modulo 30 system established in the text. Repeat with the message 
‘The quick brown dog jumps over the lazy fox.’ 


4 Using row (2) on p. 421 (A = 3, B = 6 ,..., Z = 26) . = 27, , = 28, 
? = 29, □ = 30), the word ADAM was coded as CDNE. What was the 
encoding matrix? Using this matrix, how would the message ‘BEEF 
ESSENCE’be coded? 


13.5 Determinants 

We have already met instances of both 2x2 and 3x3 determinants. 
Unlike a matrix, a determinant is not just a table of numerical data, but 
(quite differently) a shorthand way of writing an algebraic expression, 
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whose value can be computed when the values of the 4 terms 5 or 4 elements 5 
are known. 

There is little to be said about 2x2 determinants: 


a b 
c d 


ad — be 


and this method of writing such a two-termed expression does not carry 
many advantages. However, there is some point in doing a little work on 
them in the following exercise since the rules for manipulating 2x2 
determinants are imitated by those for larger determinants. For example, 


if 


A = 


a b 
c d 


then 


ak 

ck 


= kA , and 


ak bk\ 
ck dk\ 


= k 2 A 


(not kA, as with matrices). 
So with 3x3 determinants, 



a i 


C 1 


a x k 

b x k c x k 

if A = 

a 2 

b 2 

C 2 

, then 

a 2 

b 2 c 2 


a 3 

b 3 

C 3 


a 3 

^3 C 3 


= kA: 


while if every element were multiplied by k , the value would be k 3 A. 


Exercise 13.5a 

1 Evaluate: 


{iv) 


(vii) 


3 -4 
-2 1 ’ 

173 163 

94 104 ’ 

300 -400 
-240 320 


(») 


8 -3 

2 6 


■42 6 

-36 48 


(Hi) 


(vi) 


8 5 

80 50 ’ 

13 1 
1 001 17\ 


2 Show that 


a b 
a 2 b 2 


has a factor (a 


b ). Generalise the result. 


3 Simplify: 

cos 9 sin 9 
— sin 9 cos 9 


a + b a — b 
a — b a + b 


( in) 


cos 9 
sin 9 


sin 9 
cos 9 5 


cosh u sinh u 
sinh u cosh u ’ 


(v) 


21 1 - t 2 

1 - t 2 21 


(vi) 


px + qz 

py + <7 z 


rx + jz 
ry + sz 
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If A = 


a b 
c d\ 


show that 


and 


cl + c b + d 
c d 

both have the value A, and write down some other determinants which 
have the same value. 


a b + ka 
c d *4“ kc 


5 Express 


a 4 x b 4- y\ 
c 4 p d 4 q\ 


as the sum of determinants. 


6 Find the value of X so that the equations 

2 x 4 5y = Xx 
3x 4 4 y = Xy 

may have non-trivial solution. 


7 Solve the equation for x: 


2 - x 
-6 


1 


= 0. 


3 4 x\ 

8 Show that, in general, the solutions of the simultaneous equations 

x y 1 

K ~\ = 17. 

may be written 


a x x 4 b x y 4- c x — 0 
a 2 x + b 2 y 4 c 2 — 0 


b. 

Cl 

1 

*1 


ai 

*i 

b 2 

c 2 

\c 2 

CM 


a 2 

CM 

-C5 


Use this: 

(i) to solve 3 x — y = 


-8, x 4 ly = 1; 


(ii) to eliminate x from the equations 


ax 2 4 bx 4 c = 0 
px 2 4 qx 4 r = 0. 


3 x 3 determinants 

We now consider the 3 straight lines with equations: 

a x x 4 b x y 4 c x = 0 
a 2 x 4 b 2 y 4 c 2 = 0 
a 3 x 4 b 3 y -F c 3 = 0 

and the condition for them to be concurrent. Solving (2) and (3), gives (in the 
case of non-zero denominators): 

x u 1 


( 1 ) 

( 2 ) 

(3) 


b 2 c 3 b 3 c 2 


c 2 a 3 c 3 a 2 


a 2 b 3 


a 3 b 2 


the point of intersection of lines (2) and (3). If this point is to lie also on the 
first line, its coordinates must satisfy (1), and so by substitution we obtain 


a i [b 2 c 3 ^3^2) ^ 1 (^c 2 a 3 c 3 a 2 } c 1 {ct 2 b 3 


a 3 b Q ) — 0 


(4) 


The expression on the left-hand side is precisely what we have already 
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called the determinant of the coefficients in the three equations, written 



b x c x 
b 2 c 2 

^3 C 3 


Such expressions occur so frequently in mathematics that the determinant 
shorthand notation was devised for them. Just as expressions like 
xxxxxxxxx have the abbreviation # 5 , and with this abbreviated 
notation comes manipulation using the laws of indices , so with determinants 
we shall want to devise corresponding methods of manipulation. 

Another instance of the occurrence of 3 x 3 determinants in coordinate 
geometry is in the area of a triangle formed by 3 points (x l , yf, (x 2 , y 2 ), 
( x 3 , y 3 ) which is given by the expression 


A = i(*!0 2 - *201 + *203 - *302 + *301 ~ *10 3 ) = 


01 
y 2 
y 3 


l 

i 

l 


The condition for collinearity of 3 points is of course that this determinant 
should vanish. 

In this section we shall deal principally with 3x3 determinants! and 
then indicate briefly how their behaviour is imitated in the case of deter¬ 
minants of higher order. 


Expansion of determinants: minors and cofactors 


The expansion of the general 3x3 determinant has already been given: 



a\b ( 2p<$ a i b 3 c 2 b \C 2 a 3 b\£ 3 a 2 4~ ^i^2^3 c^a 3 b 2 . 


^3 ^3 


Each term is the product of 3 elements of the determinant, no two in the 
same row or column (i.e., there are no terms like b l a 2 b 3 , or af^cf), so it is 
easily seen that the number of terms must be 3 x 2 x 1 = 6. In the 
case of 4 x 4 determinants, the number of terms (such as a 2 b l c 4 d 3 ) would 
be 4 X 3 x 2 x 1=41 = 24, and in the general case of a n x n deter¬ 
minant, there will be n \ terms in the expansion. Thus the notation gains in 
brevity as the order of A increases: a 5 x 5 determinant is a very efficient 
abbreviation for an algebraic expression with 120 terms. 

In dealing with determinants of higher order, one needs to know the 
rule for deciding which of the n\ terms carry a T and which a — sign. 


f Determinants must be square: there is no such thing as a rectangular determinant. 
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This rule is bound up with the theory of odd and even permutations,! and 

we do not need to concern ourselves with it here. 

The value of the 3 X 3 determinant may be written in many alternative 
ways, each of which we abbreviate using the notation of 13.4 : 

a x (b 2 c 3 — b 3 c 2 ) + b\{c 2 a 3 — c 3 a 2 ) + c 1 (a 2 b 3 — a 3 b 2 ) 

= a x A x + b 1 B 1 + c x C x ( 1 ) 

a 2 (b 3 c x — b x c 3 ) + b 2 (c 3 a x — c x a 3 ) + c 2 (a 3 b x — a x b 3 ) 

ci 2 A 2 b 2 B 2 ~ I - c 2 C 2 ( 2 ) 

d 3 {b\C 2 ^2^1) "f b 3 {c \&2 ^2^1) ^31^2 ^2^1) 

= a 3 A 3 + b 3 B 3 + c 3 C 3 ( 3 ) 

a x {b 2 c 3 b 3 c 2 ) — I - ^2(^3^! b 1^3) — I - ^3^b x c 2 ^2^1) 

= u x A x 4 - a 2 A 2 + <23^3 ( 4 ) 

b x (c 2 a 3 £3^2) d~ b 2 {c 3 d x ^1^3) "b b 3 f x d 2 ^2^1) 

= b x B x + b 2 B 2 + b 3 B 3 (5) 

c 1 (a 2 b 3 — d 3 b 2 ) + c 2 (a 3 b x — a x b 3 ) + c 3 (d x b 2 — a 2 b x ) 

= c x C x + c 2 C 2 + c 3 C 3 ( 6 ) 


(1) is called the ‘expansion of the determinant by the first row': the 
elements a x , b x , c x of the first row are multiplied by their corresponding 
cofactors A x , B x , C x to form the inner product a 1 A 1 -f b x B x + c x C x . (2) and 

(3) are expansions by the second and third rows respectively, while in 

(4) , (5), (6) we have written the expansion by the first, second and third 
columns. 

We now need to recognise at sight the cofactors of the several terms of the 
determinant 




a \ b x c x 
-# 2 — b -^— 
d 3 b 3 1‘ 3 

-C 2 








+ 

- 

+ 



- 

+ 

- 


_J 

+ 

_1 


+ 

_1 



It will be seen from the examples given above that each cofactor is either 
plus or minus the 2x2 determinant obtained by striking out the row and 
column through the element concerned, the appropriate sign being given 
by the chessboard scheme on the right. For example, if we want the co¬ 
factor of c 2 , we strike out the second row and third column, i.e., those which 
intersect in the element c 2 as in the third part of the diagram, thereby 


obtaining the determinant 


which is known as the minor of c 2 . 


f SeeBudden, F. J., The Fascination of Groups, C.U.P., 1972, p. 312 ; Moakes, A. J. and Neill, H., 
Vectors , Matrices and Linear Equations , Oliver and Boyd, 1967. 
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The scheme of signs shows that C 2 has a value which is the negative of this 
minor, i.e., 

C'2 ~ ~ a fo = ( a 1^3 — a 3^l) = a 3^l ~ a i^3 * 

The rule is true for determinants of any order (and enables the general 
determinant to be defined recursively): 



In theoretical work we prefer to work with cofactors rather than minors, 
since the expansions as in (1 )... (6) do not contain any — signs, the co¬ 
factors themselves incorporating the appropriate signs. The latter are 
therefore sometimes referred to as c rectified 5 minors. 


Exercise 13.5b 

1 Evaluate the following determinants and check your answers by ex¬ 
panding them in several different ways: 

6 3-7 13-1 -13 1 

(0 5 0 2; (ii) 23 1 ; (Hi) -6 12 3 

2-14 334 610 

1-2-3 4 

60 18 49 

-2 3 4-5 

(iv) 50 0 -14 ; (v) 

3 -4 -5 6 

20 -6 -28 

-4 5 6-7 
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2 Expand the determinants 

a b c a c b a b c 

cab , b a c , b c a 

b c a c b a cab 

and deduce the values of 
a b 0 1 x 2 x 

b 0 a and x 1 x 2 

0 a b x 2 x 1 

3 Evaluate the cofactors of some of the elements in the above deter¬ 
minants. 

a h g 

4 Write down the cofactors in h b f beginnings! = be —f 2 . 

g f c 

Show that hA + bH + fG = 0, and obtain similar results. 

5 Solve the equation for x : 

2 — x —3 5 

0 5 - x 2 = 0. 

8 —6 8 — x 

2 -3 1 

6 If A = 1 4 6 , show that \A\ = 0, 

0 -1 -1 

2 -4 -22 

and that |adj. A\ 1 —2 —11 
-1 2 11 

What do you notice? (Compare exercise 13.5 d. no. 8 ). 

7 Expand the determinants: 

a a 0 1 1 1 be 2 + b 2 c be 1 

(i) 0 b b ; (ii) x y z ; (Hi) ca 2 + c 2 a ca 1 

c 0 c y + zz + xx + y ab 2 + a 2 b ab 1 
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8 Eliminate x, y , and z from: 

( i ) x + ay — bz = 0 (ii) a{y — z) — x 

— ax + y + cz — 0; b(z — x) = y. 

bx — cy + z = Q c(x — y) = z 

9 What is the volume of a parallelepiped, three adjacent edges of which 
are the vectors OP r , where P r is (x r9 y r9 z r ), r = 1, 2, 3. Find the volume 
of a tetrahedron having vertices O (0, 0, 0), Pj ( — 2, 1, 0), P 2 (3, 1, 1), 
P 3 (1,0, -4). 


io Show that the result of eliminating x from the equations 

I a b c 01 


ax 2 + bx + c = 0 

can be expressed 

px 2 + qx + r — 0 


0 a b 

P q r 


c 

0 


= 0 . 


I 0 p q r | 

(Compare exercise 13 .5a, no. 8). 

11 Eliminate x from the equations 

ax 2 + bx + c = 0 and px 3 + qx 2 -f rx + s = 0, 

giving the result as a 5 x 5 determinant equated to zero. 


Row and column operations 

Before dealing with the rules for the manipulation of determinants, we 
shall introduce a notation which will be found convenient throughout the 
remainder of this chapter for both matrices and determinants. 

We may regard a matrix as a collection of row vectors which we shall 
call r l5 r 2 , r 3 , etc., and similarly the columns will be denoted c l9 c 2 , c 3 , 



0 3-1 

H\ 

So, in the determinant 

2 5 1 

-10 2 

, Co denotes the vector 1 1 1 

w 


Furthermore, we shall use a notation similar to that of transformations 
to indicate operations of replacing individual rows or columns. For 
example, r' 2 = r 2 — 2r x simply means ‘create a new second row by 
subtracting from its elements twice the corresponding elements of the first 
row’; so that the above determinant becomes 


0 

3 

-1 


0 

3 

-1 

2 - 2x0 

5-2x3 

1 — 2x — 1 

, or 

2 

-1 

3 

-1 

0 

2 


-1 

0 

2 


Similarly, the operation c 2 = 2c 2 has the effect of creating a new second 
column by doubling the elements of its predecessor. 
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It may be noticed that row operations on matrices can themselves be 
performed by the use of matrices. For example, r 2 = r 2 — 2 r 1 can be 
achieved by pre-multiplying the given matrix by 


( 


1 

-2 

0 


0 

1 

0 



Similarly, the operation r\ = ccr 2 + j8r 3 is achieved by pre-multiplying 
by 


/0 a 
0 1 0 , 
\0 0 1 / 


fO a 




tota 2 4- pa 3 &b 2 + fib 3 ctc 2 + jS c 3 


for 


0 1 0 
,0 0 1 




CL 2 
a 3 


b 2 

^3 


r 2 

C 3 


whilst the effect of pre-multiplication by 

/I 0 0 

0 10 
\0 0 a 

would be to carry out the operation r f 3 = ccr 3 . Matrices which perform 
these row operations are known as elementary matrices and we shall use them 
extensively in the solution of systems of linear equations ( 13 . 7 ). 

Lastly, the reader might like to investigate how column operations can 
be achieved by means of post-multiplication by similar elementary matrices. 


Rules for manipulation of determinants 

We shall not go into the tedious process of proving these rules, but will 
simply illustrate them, letting them speak for themselves. 

( i) \M\ = |Af'|, i.e., the determinant of a matrix is unchanged when the matrix is 

transposed. 

^ (the reader should verify that the expansion of 
A' agrees term by term with that of A). 



(ii) Multiple of row (or column ), 


ka x 

K 

c i 



a x 

b, 

^1 

ka 2 

b 2 

c 2 

— kA ; 

A' = 

a 2 

b 2 

C 2 

ka 3 

h 

C 3 



ka 3 

Ms 

fo 3 


e.g., A' = 


= 2A, etc. 
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{Hi) When two rows {or columns) are interchanged , the determinant changes sign , 



— A: this may be verified by actual expansion. 


Corollaries: 

{a) If two rows {or columns ) are identical , the determinant has the value zero (for in 
this case :A = — A => A = 0 ). 

(b ) If two rows {or columns) are proportional , then the determinant is zero , 

The two proportional rows (or columns) are, in fact, linearly depen¬ 
dent (see 13 . 1 ) and the determinant is also zero if its three rows (or 
columns) are linearly dependent, but this cannot usually be spotted at a 
glance. 

{iv) A determinant is unchanged when to the elements of any row {or column) are 
added a constant multiple of the elements of any other row {or column). 

For example, 

a 1 b 1 c j 

if A — ^2 ^2 ^2 

a 3 ^3 c 3 

let r[ = r l + hr 2 . 

a± + Xa 2 T Ac 2 

Then A' = a 2 b 2 c 2 

a 3 ^3 c 3 

a j b j c j a 2 b 2 c 2 

a 2 b 2 c 2 “I - h a 2 b 2 c 2 

a$ ^3 ^3 ^3 ^3 ^3 

= A + X x 0 
= A. 

Finally, it is also worth expressing corollary Hi {a) in terms of cofactors. 
Suppose, for instance, that the second row of A is replaced by its first row 
(i.e., r' = r,). 

a x b x c x 

Then a 2 b 2 c 2 = A is replaced by 

a% ^3 ^3 
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a \ b x c i 

a \ c x = 0 (since two rows are identical). 

a 3 ^3 C 3 

Now the cofactors of the second row have been unaffected by the replace¬ 
ment, so the equation 

^2^2 ^2^2 cffd^ = ^ 

has been replaced by 
d±A2 4- ^ 1^2 “h CjfT? = 

This expression consists of the elements of row 1 of the original determinant 
taken-with the cofactors of a different row (row 2), or alien cofactors as they 
are usually called. 

Similarly a t A 3 + b l B 3 c x C 3 — 0; 

and more generally we may say, calling any such expression an inner 
product , that 

The inner product of the elements of a row (or column) with the 
cofactors of the same row (or column) gives the value of the determinant ; 
but the inner product of the elements of a row (or column) with the 
alien cofactors of a different row (or column) gives zero. 

Example 1 

2 4 18 

Evaluate A = — 3 10 —27 

1 3 11 

We try to use row and column operations to obtain as many zeros as 
possible: 

1 2 9 

A = 2 -3 10 -27 

1 3 11 

1 2 9 

= 2-3 10 -27 (r 3 = r 3 - r x ) 

0 1 2 

1 2 5 

= 2-3 10 -47 (c 3 = c 3 - 2c 2 ) 

0 1 0 
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1 5 

= -2 

-3 -47 

= —2( —47 + 15) 

= 64. 

Alternatively, 

1 2 9 

A = 2 -3 10 -27 

1 3 11 

1 9 

= 32 = 32 x 2 = 64. 

1 11 

Example 2 

1 a a 3 

Factorise 1 b b 3 

1 c c 3 

Putting b = c gives two identical rows, thus causing the determinant to 
vanish. Therefore, by the remainder theorem, (b — c) is a factor. Similarly 
(c — a) and (a — b) are also factors. Hence A = (b — c) (c — a) (a — b)F , 
and consideration of the degree of the determinant enables us to see that 
the remaining factor F must have degree 1. Clearly it is symmetric in 
a , b and c, so must have the form k(a + b + c), where A; is a constant. The 
‘leading term’ lbe 3 (the product of the elements in the leading diagonal) 
has coefficient 1, and this shows us that k = 1. Hence 

A = (b — c) (c — a) (a — b) {a + b + c) 

Example 3 

abed 
d a b c 

Evaluate A = 

c d a b 
b c d a 

b + a + d + c c + b + a + d d + c + b + a 
a b c 

dab 
c d a 

(r; = r, + r 2 + r 3 + r 4 ) 




1 

2 

9 

= 2 

0 

16 

0 


1 

3 

11 


O 2 = r 2 + 3 r t ) 
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1111 

d a b c 

— (a b + c + d) 

c d a b 
b c d a 

It is, in fact, unnecessary to go so far: all we need is to obtain the factor 
(a + b + c + d). Similarly, r[ = r x — r 2 + r 3 — r 4 detects the factor 

(a — d + c — b), and so A = (a + b + c + d) (a — b + c — d)F. The 

remaining factor F, being homogeneous of degree 2, may contain terms in 
a 2 , b 2 , c 2 , d 2 , ab, ac, ad, be, bd, cd. But it is easier to find it by using complex 
numbers. For r[ = r x + ir 2 — r 3 — ir 4 replaces r x by 

(a + id — c — ib b + ia — d — ic c + ib — a — id d + ic — b — ia) 

= (a — ib — c + id i(a — ib — c ~fi id) — (a — ib — c + id) 

— i(a — ib — c + id)), 

showing that (a — ib — c + id) is a factor. Similarly, the conjugate 
(a -h ib — c — id) is also a factor, and we get 

A = (a + b + c -f d) (a — b + c — d) {a -f ib — c — id) {a — ib — c + id). 
Finally, combining the conjugate factors gives 

A = (a + b + c + d) [a — b + c — d) (a 2 + b 2 + c 2 + d 2 — 2 ac — 2bd). 

Exercise 13.5c 

1 Perform the operations specified on the given determinants, and hence 
evaluate: 

4 -18 3 

(0 2 12 -4 

-4 0 5 

1 a b 
(ii) 1 a 2 b 2 
1 a 3 b 3 

a — b — c 2 a 2 a 

2b b — c — a 2b \ 

r 2 

2c 2c c — a — b 

b c 

c a c\ — c! T coc 2 + (jo 2 c 3 . where co 3 = 1. 
a b 


a 

(iv) b 
c 




r\ = r x + r 3 

r 3 = r 3 + 2r 2 ; 
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a b c 

2 Use the cube roots of unity to factorise cab 

b c a 

(r\ = r x + r 2 + r 3 ; compare the text above, and exercise 10 . 4 b, no 9). 

3 Use the result of no. 2 above to write down the factors of 

a —l 0 1 a a 2 

0 a — 1 and of a 2 1 a 

— 1 0 a a a 2 1 

What is the value of the latter determinant when a is a complex cube root 
of unity? 

4 Factorise the following determinants: 

1 be a b + cc-\-aa-\-b 

( i ) 1 ca b ; (ii) be ca ab 

1 ab c a b c 



b + c 

c a 

a 4~ b 


a 

be 

a 2 


1 

a 

b 

+ 

c 

iii) 

c 

a 

b 

; (iv) 

b 

ca 

b 2 

; (*>) 

1 

b 

c 

+ 

a 


1 

1 

1 


c 

ab 



1 

c 

a 

+ 

b 


5 Show that: 

a b c 

a 2 b 2 c 2 = k(a + b + c)(b — c){c — a) (a — b), 

b + c c -\- a a + b 
and find the value of the constant k. 
baa 

6 Evaluate aba 

a a b 

and find other possible values of a 3 x 3 determinant which contains six 
a ?s and three s. 


7 Express in factorised form the determinants: 


b 2 + c 

ab 

ac 


ab 

c 2 + a 2 

be 

; («) 

ca 

cb 

a 2 + b 2 



a 

a 

b 

a 


a 

a 

a 

b 
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1111 abed abed 

1—111 bade b a c d 

1—1—11 c d a b a b d c 

1—1—1—1 d c b a bade 

1 2 4 

8 Evaluate all the cofactors in the determinant 2 4 8 

4 8 16 

Write down several other examples of determinants which have all the 
2x2 minors equal to zero. 

Compose a 4 x 4 determinant which has every one of its 3 x 3 minors zero. 
Multiplication rule for determinants 

Suppose that A and B are two 3 x 3 matrices and that their determinants 
are denoted by \A\ and | 2 ?|. Now we have seen (in 13.3) how to form, by 
row and column multiplication, their product matrix AB , which in turn 
will have a determinant denoted by \AB\ . Is it, we wonder, possible to 
state the value of \AB\ simply from our knowledge of \A\ and \B\ without 
having to find the product AB and then go through the tedious process of 
calculating its determinant? 

A little reflection will show that this is easy. For if the successive trans¬ 
formations 

x " = Ax' x — Bx 
are combined to give 
x" = ABx , 

volumes are multiplied by the factors \A\ and | 2 ?| under the separate trans¬ 
formations, and by \AB\ under the combined transformation . 42 ? (see 13.2). 
But the final effects in both cases must be identical. 

so 

Similarly \BA\ = \B\ x \A\, so AB and BA have the same determinant, 
even though the matrices themselves may well be different. 

A similar proof, using areas, applies in the case of 2 x 2 determinants 
and the result is also generally true for determinants of any order. 

Exercise 13.3d 

1 ( i ) Verify, by expansion of the determinants, that 

a a b <xa + fie aib + fid 

yd c d ya -h 5 c yb + Sd 
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(it) Express the square of D = 
determinant. 


a b 
I c d\ 


in four different ways as a 2 x 2 


(Note that if D' = 
D'D\) 


a c 
b d 


then we may have DD ' 7 D'D , DD and 


2 Find a counter-example to show that, if X 7 F, Z are square matrices, 
such that Z = X + F, then det X + det F ^ det Z. 


3 If D = 
different ways. 

4 Express 


a c 0 
b 0 c 
0 a b 


express D 2 as a 3 x 3 determinant in two 


a 2 + 


b 2 ab + pq ac + pr | 

\ab + pq b 2 + q 2 be + qr 

ac -f pr be + qr c 2 + r 2 

as the square of a determinant, and hence prove that it is zero. 

a 1 b x c 1 

5 Using the notation |a b c\ to denote a 2 b 2 e 2 

^3 ^3 ^3 

where a, 6, c are column vectors, prove that 

\a + x b c\ = \a b c\ + \x b c |, 

and that |a + x+y b c\ = \a b c\ + \x b c\ + \y b c\, 

and express |a + x b -j- y c + z\ as the sum of eight determinants. 

6 Prove the row-by-column rule for multiplication of 3 x 3 determinants 
by expressing the determinant of the product matrix in 13.3 (see p. 399) as 
the sum of 27 determinants, 21 of which may be proved to be zero. 


(see 13.4, p. 410), 



a x 

K 

U 



A 2 

^3 

7 If A = 

a 2 

b 2 

e 2 | 

and A* = 

B , 

b 2 

B 3 

1 

a 3 

^3 

C 3 


^1 

C 2 

C 3 


show that A* = A 2 , and that 


B 2 B 3 

c 2 c 3 


= a x A, 


with similar results for the other minors of A*. 


8 If a 3 x 3 determinant is zero, prove that any pair of rows or of columns 
of the adjoint determinant are linearly dependent, 
j ^4 4 

e.g. that — 1 = —^ assuming B u B 2 , B 3 # 0 

n l n 2 d 3 

(use the result of the previous question). 
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9 Show how to construct 2 x 2 and 3 x 3 matrices with elements which 
are integers so that their inverses may also have elements which are 
integers, (cf. ‘codes’, pp. 420-423). 


* 13.6 Systems of linear equations 


Use of determinants in the solution of linear equations 


For practical purposes, particularly in the case of large numbers of equa¬ 
tions with large numbers of unknowns (as are frequently encountered in 
commercial, industrial and scientific situations), the method of reducing 
to echelon form (see 13.7) is the most convenient. But it is also instructive 
to approach the problem of the solution of a set of linear equations by using 
determinants. 

We shall merely hint at the case of a 2 x 2 system : 


a \ b x 
a 2 ^2 

the solutions may be written 


a x x + b x y = c x 
a 2 x + b 2 y = c 


by noting that, if A = 


Ax = 


c \ 

C 2 


b 1 
b 2 \ 


A y 


a 1 C l 

a 2 c 2 


these being true in the non-regular case (A = 0 ) as well as in the regular 
case. {cf. pp. 417-419). 

Proceeding immediately to the 3 x 3 case, 


a x x + b x y + c x z = d x (1) 

a 2 x + b 2 y + c 2 z = d 2 (2) 

a 3 x + b 3 y + c 3 z = d 3 , (3) 


a possible elementary way of obtaining solutions would be to eliminate z 
from ( 1 ) and ( 2 ) to obtain a new equation ( 4 ) in x and y ; then to eliminate 
z from ( 2 ) and ( 3 ) to obtain a second equation ( 5 ) in x and y\ and finally 
to eliminate y from ( 4 ) and ( 5 ) thereby enabling x to be found. How much 
simpler if this multi-step solution could be telescoped into a single step! 
In fact, this can be done by multiplying ( 1 ) by A x , ( 2 ) by A 2 , ( 3 ) by A 3 , 
and adding, leading to 


(dZjv 4 j a 2 A 2 "l - a 3 Afjx "l - i^b x A x ~ j - b 2 A 2 H - b 3 Afy 

+ (c x A x + c 2 A 2 + c 3 A 3 )z = d x A x + d 2 A 2 + d 3 A 3 . 
In view of the properties of cofactors (see 13.5), we know 
a x A x 4 - a 2 A 2 + a 3 A 3 — A, 

while the coefficients of y and z are zero because the elements of columns 2 
and 3 have been combined with ‘ alien ’ cofactors, those of column 1. Thus 
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y and z have been eliminated in a single tour deforce ! The expression on the 
R.H.S. has the value 


d\A^ + d 2 A 2 “b d 3 A 3 


d x b 1 <4 

d 2 ^2 ^2 

^3 ^3 ^3 


det (d, b, c), 


where det (d, 6, c), which we will call A t , is simply the determinant A 
with column 1 (the col. vector a), replaced by the col. vector d. 

Thus Ax — A j; and similarly we obtain 

Ay — A 2 , 

where A 2 is A with its second column replaced by the column vector d\ 
and finally, 

Az = A 3 . 


So 


Ax = Aj, Ay = A 2 , Az = A 3 


If we make use of the matrix 



and its adjoint matrix M* (see 13.4), the foregoing proof can be consider¬ 
ably shortened. For the original equations ( 1 ), ( 2 ), ( 3 ) can be written in 
matrix form as 


Mx — d 

=> M*Mx — M*d 

=> AIx = M*d 

=> Ax = M*d 



So, again Ax — A x , 



Ax = A x d x + A 2 d 2 “b A 3 d 3 
=> Ay = B^d^ -j- B 2 d 2 ~b ^3^3 
Az — -|- C 2 d 2 ~b C 3 d 3 


Az = A 3 . 


This solution in determinant form is known as Cramer's rule and is 
clearly capable of generalisation to any number of equations. But it 
suffers from the very serious disadvantage that evaluation of determinants 
is usually extremely tedious. 
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Example i 
Solve the equations 

x + y — z = 1 
x — y + z = 2 
2x —2 = 4. 

1 1-1 

Here A = 1 -1 1 = 1(1) - l(-3) - 1(2) = 2 

2 0-1 
1 1-1 

Aj = 2 — 1 1 = 1(1) - 1 (— 6 ) - 1(4) = 3 

4 0-1 

1 1-1 

A 2 = 1 2 1 = 1 (— 6 ) - 1 ( —3) - 1 ( 0 ) = -3 

2 4-1 
1 1 1 

A 3 = 1 — 1 2 = 1 (— 4) - 1 ( 0 ) + 1(2) = -2. 

2 0 4 

So 2x = 3, 2y = — 3, 2z = — 2 

=> x = §, y — —f, z — — 1 

(as we could have found more simply by elimination!). 

Example 2 

Solve the homogeneous equations 

a x x + b x y + c x z — 0 
a 2 x + b 2 y + c 2 z = 0 
a 3 x + b 3 y + c 3 z = 0 . 

Here it is immediately clear that 

A x = A 2 = A 3 = 0 => Ax = Ay = Az = 0 . 

So if A ^ 0 , it follows that the equations have a unique set of solutions 
x = y = z = 0 ; in other words, if there should exist a set of solutions 
which are not all zero, then we may be sure that A = 0 . 

Conversely (see exercise 13 . 6 a, no. 8 ), it can be shown that if A = 0 , 
then it is possible to find values of x, y , z which are not all zero satisfying the 
original equations. 
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So 


The existence of values x, y , 2 not all zero 
satisfying the homogeneous equations 


Regarding these three equations as representing three planes through 
the origin O, we see that they also have some other point P in common if, 
and only if, they intersect in at least a line (OP), in which case their equa¬ 
tions are linearly dependent and A = 0. 

But this important result may also be reconsidered in several other 
ways: 

First, A = 0 is the condition for the three straight lines a t x + b$ + c { = 0 
(i = 1 , 2 , 3 ) to be concurrent (see 13.5, p.425). 

Second, A = M = 0 is also the condition for the matrix M to be singular 
(see 13.4), that is, for the non-existence of an inverse. 

Third, in the context of the system of non-homogeneous equations 

Mx = d {d * 0 ) 

is the condition for the system to be non-regular, and this we shall deal 
with in 13.7. 

Fourth, \M\ is the value of the scalar triple product a. (b X c) which 
represents the volume of the parallelepiped formed by the vectors a, 6, c 
as adjacent edges (see 13.2). Hence \M\ = 0 is the condition for its volume 
to vanish, and so for the vectors a, 6, c to be linearly dependent. And, 
similarly, it is also the condition for the three row-vectors 

(a h b h c t ) (i = 1 , 2 , 3 ) to be linearly dependent. 

Example 3 

It is also useful to see how Cramer’s rule can be obtained by vector methods. 
For equations ( 1 ), ( 2 ), and ( 3 ) above may be written in vector form as 

xa 4 - yb + zc = d. 

Remembering that vector product distributes over addition, we take the 
vector product of both sides X b , giving 

xaXb + ybxb-\-zcXb = dxb. 

But b X b = 0 . 

Hence xa X b + zc X b = d X b. 

Now take the scalar product of both sides with c, and again remember 
that scalar product distributes over addition. 

Then x[a X b).c + z{c X b).c = (d X b).c. 

But (c X b) .c == 0 . 
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Hence x(a X b).c = (d X b).c. 

But a X b ~ (a 2 b 3 — a 3 b 2 )i 4 ( fl 3^i — a \b 3 )j + ( a \b 2 — a 2 b^)k 
and c = c x i 4 ctf 4 c 3 k. 

So (a X b).c = {a 2 b 3 - a 3 b 2 )c l + (a 3 b 1 - a i b 3 )c 2 4 ~ ^1)^3 

— ^2 ^2 ^ ? 

^3 ^3 ^3 

b j C j 

and similarly, (d X b) .c = d 2 b 2 c 2 = A x . 

<af 3 Z > 3 r 3 

So Ax = A 1} and similarly Ay = A 2 , Az = A 3 . 

Exercise 13.6a 

1 Use Cramer’s rule to solve: 

(i) 2 x — 5 y 4 3 = 0 (iz) 2x — 3 y — 1 — 0 
3 x 4 y — 11 = 0 ; 4 x — 6y 4 5 = 0 ; 

(Hi) lx + my + nz = 0 
mx — ly 4 pz = 0. 

2 Use Cramer’s rule to solve the simultaneous equations: 

(*) x — z = 4 (ii) 4 x — y — 2 z = —7 

2 x 4 - y — 3 z = — 2 3 x 4 5 y — 6z = 4 

4 x 4 2 y 4 5 z = — 3 ; 6x — 13 y 4 - 6z = — 29 . 

3 Use Cramer’s rule to solve for cos 9 and sin 9 in terms of b , c, /?, y, 
and r, and hence eliminate 9 : 

<2 cos 0 + £ sin 9 = c 
p cos 9 + q sin 9 = r. 

4 Obtain the condition for the quadratic equation <2x 2 4 bx 4 c = 0 , 
px 2 4 qx 4 r = 0 to have a common root by first solving these as a pair 
of simultaneous equations in x 2 and x, using Cramer’s rule, and then 
eliminating x. (Compare exercises 13.5a, no. 8 and 13.5b, no. 10.) 

5 Find the condition that the equations 

ax 4 by 4- cz = 0 
bx 4 cy 4 az = 0 
cx 4 ay 4 bz = 0 

have a solution other than (x, y, z) = (0, 0, 0). 
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6 Find the values of X for which the equations 
4 x — 6 y — z = Xx 

x — \y — z = Xy 
2 * + 3 y -\- z — Xz 

have a solution other than x = y = z = 0, and find the ratios x\y\z for 
each of these values of X. (o.c.) 

7 If co denotes one of the cube roots of 1 other than 1 itself, solve the 
equations 

* + y + z = a 
x + coy + co 2 z = b 
x + co 2 y + c oz ~ c 

for x , y, z in terms of a, b, c. Give your answer in as simple terms as pos¬ 
sible. (s.M.P. ) 


8 Consider the three homogeneous equations 



Prove that, if solutions other than x = y = z = 0 exist, then A = 0 (the 
3 x 3 determinant of the coefficients). 

Prove conversely, that if A = 0, then solutions other than the trivial ones 
may be found. 


(Note that Ax = Aj =0, etc., and that if A = 0 , then any arbitrary 
value of x will satisfy. Note also that 



9 Prove that if a, ft are real numbers and a /0, then the equations 
x + y + 2 z = 5 

2 x + y + 2 = 2 
3 x + y + ocz = 

have a unique solution. Prove further, that if a = 0 , then these equations 
have a solution only if p takes one particular value, which should be found. 
Find all the solutions of the equations in this case. (o.s.) 

10 Given that A; is a real constant, solve completely, when possible, the 
simultaneous equations 
kx 4- 2 y + 8z = 0 

x — y + 2 kz = 0 
x -P y + 6z = 2 k 

discussing carefully any special cases. 


(o.s.) 
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11 Three planes through the origin have equations 

n! ax + hy + gz = 0 
n 2 hx + by + fz = 0 
n 3 gx +fy + cz = 0. 

Let r r , r = 1 , 2 , 3 , be the three planes defined as follows: T t is the plane 
through the intersection of Yl 2 and n 3 , and also containing the x-axis. 
T 2 and T 3 are similarly defined. Prove that T x , T 2 , T 3 have a common 
point. 


Regular System — 4 Ill-conditioned ’ case 

We know that in the case A # 0 our original system of equations will have 
a unique solution. Note, however, that if A is very small compared with 
A 1? A 2 , and A 3 , the values of *, y, z may be large: the point of inter¬ 
section of the three planes is a long way off, if we think in terms of the first 
interpretation of p. 418; or the matrix is nearly singular, in the second inter¬ 
pretation ; or the 3 vectors a, b , c are very nearly coplanar in the case of the 
third. 

In such a situation, the equations (and the matrix M) are described as 
ill-conditioned . This means that the solution is very sensitive to changes in 
the data: move one of our three planes by a very slight alteration of perhaps 
just one of its coefficients, and the common point of the three planes may be 
moved a long way. 


The singular case , in terms of Cramer's rule 

In a non-regular set of equations, where A = 0 , we know that no finite 
solutions are possible in x, y , z unless all three of the determinants A x , A 2 , 

A 3 are zero. This may happen, of course by having d — 0, i.e., d l = 0, 

d 2 = 0, and d 3 = 0. 

In the more general case when d ^ 0 , it may be wondered whether it 
is possible for A x to be zero without A 2 or A 3 necessarily being zero. At 
first it seems that this is not possible, for one may argue as follows: 

Firstly, A = 0 implies that a, 6, c are linearly dependent, 

Aj = 0 implies that d , 6, c are linearly dependent. 

Hence it appears that a, 6, d are linearly dependent, so that A 3 is also 
zero, and similarly A 2 = 0. 

Alternatively, we may think of the three planes represented by the three 
linear equations. Now since in any case A = 0, we know the three planes 
either have a common line or else form a triangular prism, including the 
case when some pairs are parallel. Consider now the lines in which the 
planes intersect the plane z = 0. 
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They have equations 


a \ x + b t y = d x ' 


a x b x d, 

a 2 x + b 2 y = d 2 

► and the vanishing of the determinant A 3 = 

a 2 b 2 d 2 

a 3 x + b 3 y = d 3> 


a 3 b 3 d 3 


ensures that these three lines have a common point (taking the more 
general case when they are not all three parallel). In such conditions, the 
three planes do not form a triangular prism, but will have a common line. 
This means that they also cut the plane y = 0 in three concurrent lines, 
whose equations are 

a { x + c l z = d x 

a 2 X I C <2%, — 

&$X + C 3 Z — d 3 

and whose determinant should therefore vanish. This would give the result 
A 2 = 0 , and similarly we should have A 3 = 0 . 

However, the above arguments overlook the possibility that A and A t 
vanish as a result of the two vectors b and c being linearly dependent. In 
such a case, a and d can be freely chosen as any two vectors, so it would 
clearly be possible to avoid the vanishing of either A 2 or A 3 , or of them both. 


Exercise 13.6b 

1 Consider the lines 

2 x + 3 y — 4 = 0 
6 x + ky — 2 = 0 

in the case when k = 9 , and when k = 9 + < 5 , where S is very small. 

2 Find k so that the planes 
2 x — 3 y + z = c 

3 x — y = —4 
x + ky + 2 z = 3 

may form a triangular prism in general. 

When k has this value, find c so that the planes may have a common line, 
and check that, in this case, all four of the 3 x 3 determinants of the 3 x 4 
matrix (the ‘augmented matrix’ of the set of equations) are zero. 

In this case, find constants />, <7, and r, such that: 

p{ 2-3 1 c) + q( 3-1 0 - 4 ) + r{\ k 2 3 ) = (0 0 0 0 ). 

In the case c = 1 , find the effect of changing k slightly from its critical 
value. 
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3 (z) Find the values of A and for which the equations 

Xx + y = 0 
x + y — z = 0 
2x + 3y - 2z = 0 

have a solution other than the trivial solution x = y = z = 0 . 

(ii) For what values of X do the equations 
Xx 4- y = 1 
x 4- Xy — z — 1 

2x + 2y — 2z = 1 have a solution? (o.s.) 

4 Find a vector which is perpendicular to each of the following pairs: 

(0 (2 1 0 ) and (-2 0 3); 

(ii) (1 4-1) and (-1 2 0); 

(Hi) (cos a sin a 0 ) and ( 0 , cos /? sin /?). 

* 13.7 Systematic reduction 

Three linear equations with three unknowns 

We now deal with the case of three simultaneous linear equations in the form 
(i) a x x 4- b x y + c x z = dA 


a 2 x 4 - b 2 y 4 - c 2 z = d 2 r 



Let us consider now the threefold*]* interpretation in this case, just as we 
did with two equations (see 13 . 4 ). 

In the first place, as an equation such as 2x — y 4- 5z = —3 connecting 
the coordinates (x, y , z) of a point in three-dimensional space represents a 
plane , our first interpretation concerns three planes in space; and in the 
general case (when the equations are regular), we expect there to exist a 


| In due course it will become apparent that the second and third interpretations are really 
the same, but expressed differently. The similarity of these to the first interpretation is not 
however so apparent, cf. 13.4, p. 417. 
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unique point common to the three planes, as for example the three planes 
which intersect at the ('trihedral 5 ) vertex of a cube. The non-regular 
cases, when there is something unusual, will be considered presently. 

In the second interpretation, we require to find which vector is transformed 
by the matrix M into the given vector rf, and in the general case this prob¬ 
lem again has a unique solution. 

Thirdly, we are attempting to find what multiples of the given vectors 
a , 6, c must be 'mixed 5 together in order to obtain the vector d ; and if all 
is well, there will again be a unique answer. 

Whenever the matrix M is non-singular, i.e., possesses an inverse, the 
unique solution is applicable. For then 

Mx = d => M~ l Mx = M~ l d , 

so that x = M~ l d is the unique solution. 

The question of actually finding the inverse of a 3 x 3 matrix was dis¬ 
cussed in 13 . 4 , and a second method will emerge in this section (p. 455 ). 
But in numerical cases, even with the facility of a computer, it is usually 
simpler to use another method, rather than to compute the inverse. 
Similarly, a problem in mechanics might lead to the following set of 
equations, in which T, S represent unknown tensions, while a, b , and c 
represent unknown accelerations: 

m 1 g — S = m 1 a; m 2 g — T = m 2 c ; c = 2a — 2b; 
m 3 g — 2T = m 3 b; S — 2T = m 4 a. 

In such a case, the values of the unknowns are best found by ‘ common- 
sense 5 elimination; it would be absurd to force the problem into the 
format of a 5 x 5 matrix with a very large number of zeros. 


Echelon form 

Suppose we are asked to express 

2 r 3 — 5r 2 — 3r + 11 in the form 

Ar(r + l)(r + 2) + Br(r + \) + Cr + D. 

Put r = 0, 11 = D (1) 

r — — 1 , 7 = -C + D ( 2 ) 

r = —2, -19 = 2B — 2C + D (3) 

r = 1, 5 = 6A + 2B + C + D (4) 

Equation (1) gives immediately D = 11 ; substitution into (2) gives C = 4. 

Knowing C and D , (3) gives B = —11, and finally (4) enables us to find 

A = 2. 

Thus 2r 3 - 5r 2 - 3r + 11 = 2r(r + l)(r + 2) - llr(r + 1) + 4r + 11. 
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Now these equations were particularly easy to solve because of the form of 
the 4x4 matrix 



where it will be noticed that there is a triangle of zeros above the N.E.—S.W. 
diagonal. Such a matrix is said to be in triangular , or echelon , form. Had we 
written the four equations in reverse order, we should have had 



The latter is a more usual form for triangular matrices — we prefer to have 
all the zeros either (as in this case) below the leading diagonal (N.W. to S.E.) 
or above it, thus: 



or 



(leading diagonal marked 
in each case) 


( 1 lower diagonal form) 


( 3 upper diagonal form) 


Reduction to echelon form 

If we start with a set of linear equations, we may attempt their solution 
by trying to obtain from them an equivalent set of equations whose matrix is 
triangular. The evaluation of the several unknowns will then be a simple 
matter, as in the example above. The method of reduction to such an 
echelon form , resembles the method used in elementary algebra: 


Example i 
Solve the equations 

v — z =4 (1) 

2x + y + 3z = -2 (2) 

4x + 2y + 5z = —3 (3) 

Multiply equation (1) by 2, and subtract from equation (2). 

Also, multiply equation (1) by 4, and subtract from equation (3). 
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These operations we shall abbreviate respectively: (2) — 2 x (1), 

and: (3) - 4x (1). 


Then 


x — z — 4 

< y + 5z = — 10 
2y + 9z = -19 
(x - z = 4 

\ y + 5Z = -10 


( 1 ) 

(5) = (2) —2 x (1) 

(6) = (3) —4 x (1) 

( 1 ) 

( 5 ) 

(7) = (6) —2 x (5) 


The equations are now in echelon form, and by ‘going up the ladder’ 
(echelon = ladder), we obtain successively z = — 1, y = —5, x = 3. In 
this particular example, there was a short cut, which we have ignored, 
namely (3) — 2 x (2), which leads to z = — 1 straight away, but we must 
be prepared for less fortunate cases! 

Reminding ourselves of the notation introduced in 13 . 5 , the working 
can now be condensed: 


r 2 = r 2 - 2 r l 
r' 3 = r 3 - 4 r 1 


r' 3 = r 3 - 2r 2 



0 

1 

2 


-1 

5 

9 


-10 

-19 



corresponding to equations (1), (6) and (7) above. 


We note the three interpretations of the solution (x, y, z) = (3, —5, — 1): 
(i) the three planes intersect at the point with coordinates (3, — 5, — 1); 

or (ii) in the linear transformation of three-dimensional space represented 
by the matrix 


1 

2 



the vector 



arises from the vector 



or (m) Letting a 
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we have found that 
d = 3a — 5b — c, 

so that d can be expressed as a linear combination of a, 6, and c. 

Example 2 

4x — y — 2z = —7 
3x + 5y — 6z = 4 
x — 3y — 4z — —9. 

We show the matrix of coefficients at successive stages, and on the right 
give the values of the row sums (see below) 



This time we are going to obtain an ‘upper’ triangular form. 



It remains to get a zero as the second element in the first row without dis¬ 
turbing the two zeros already obtained. 

r\ = 19rj - r 2 /130 0 0 —130\ 0 

( 3 19 0 35 57 

\ 1-3-4 -9/ -15 


The equations are now in echelon form: 130* = — 130 

3* + 19 y — 35 
* — 3y — 4z = —9 

and so * — — 1, y = 2, z = f. 

Note that r[ = 2 r 1 — r 3 is equivalent to the two steps r[ = 2r t (i.e., 
multiply throughout equation (1) by 2), followed by r\ = r x — r 3 . We 


might have used r\ = r l — \r 3 , but this would have caused the work to 
become encumbered by fractions. On the right, a tally is shown of row 
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sums, providing a useful check on accuracy, which is always used in 
practice. For example, when we applied r 2 — 2r 2 — 3r 3 , the sum of the 
newly formed second row (57) does satisfy the relation: 

57 = 2 x 6 — 3(—15). 


Exercise 13.7a 

1 Solve the following sets of equations using the method of the above 
section, showing the row (or column) operations, and keeping a check of 
row sums: 


(i) 2x -F 2y — 3z — 0 (ii) 2x 
3x + 2y — lOz = 7 4x 

3x -|- 4 y -\~ z = 8 * 3x 

(Hi) x + y + z = 1 

x — 2 y + z = —2 

2x + 3y — 2z = — 1; 



ly + 4z = 2 

+ 2z = -3 
2y + 7z = 1; 



2 Find numbers/?, <7, r such that: 



3 Solve the homogeneous equations for the ratios x:y:z:w: 

x + y — z+ w = 0 
2x — 3y + 4z — Sw — 0 

x + 2y + 5z + 3w = 0. 

4 Let A stand for the matrix 
/17 -4 0\ 

j 23 49 3 1 
^18 13 1 J 

Find two matrices M l , M 2 such that: 

(i) each has only one non-zero entry off the leading diagonal; 




13-7 SYSTEMATIC REDUCTION 453 

(ii) the product M 2 M X A has zeros above and to the right of the leading 
diagonal, so that its final form is 
/. 0 0 

v ° 

where the places indicated by the dots are occupied by numbers. 

From your result, deduce the value of the determinant of A. (s.m.p.) 

5 Describe a method of finding the solution of three simultaneous linear 
equations and illustrate your answer by solving the system: 

3.9* + 2 .by - 0.5z = 4.3 
5.2* + 2.9 y — 1 . 6 z = 10.9 
2.6* + 1.8 y — OJz = 2.8 

working to three significant figures. (m.e.i.) 

6 Use the method of pivotal condensation,f with a sum check, or any 
other suitable method, to find the values of *, z/, z from the following 
simultaneous equations. Tabulate the details of your working, and give 
results to three significant figures (you may use a slide rule). 

9.37* + 3.04y - 2.44z = 9.23 
3.04* + 6.18z/ + 1.22z = 8.20 
- 2.44* + 1.22 y + 8.44z = 3.93. 

Find the residual when your results are substituted in the second equation. 
Comment on the value of this residual as a guide to the accuracy of your 
results. (m.e.i.) 


Identity matrix by using row operations 


It is possible to proceed still further in this process. Reverting to Example i, 
and resuming from the point we reached when triangular form was 
achieved: 


r 'i = r i ~ r s 


r 2 = r 2 + ^ r 3 


0 

b 


0 -1 
1 5 

0 -1 
0 0 
1 5 

0 -1 
0 0 
1 0 
0 -1 



4 

-4 

0 

4 

-4 

0 

4 

-4 

0 


| i.e. by reduction to echelon form. 



454 MATRICES, DETERMINANTS, AND LINEAR ALGEBRA 


Finally 


r 3= ~ r 3 gives 


( 1 0 0 3 ' 

0 10-5 

0 0 1 - 1 , 


4 

-4 

0 


and it will be seen that we have effectively succeeded in setting up the 
identity matrix on the left-hand side of the equation. However, it is not 
necessary to go beyond the point where the echelon form is obtained. 

It is instructive to follow through the above working using elementary 
matrices. In Example i, the successive operations 


r 2 = r 2 ~ 2rj , r' 3 — r 3 — 4 r 1 , and r' 3 = r 3 — 2r 2 

are carried out by using 

1 0 0 \ / 1 0 0 \ /1 0 0 

■210,01 0 and 0 10 

0 0 1/ \—4 0 1 / \0 -2 1 

as pre-multipliers. Their combined effect is the product 

T 0 0\ / 1 0 0\ / 1 0 0\ / 1 0 0 

0 1 0 I I 0 1 0 (-2 10=1-2 10 

V0 -2 1/ \—4 0 1/ \ 0 0 1/ \ 0 —2 1 

and when this pre-multiplies the original matrix, we obtain 

( 1 0-1 
0 1 5-10 

0 0-1 1/ 

corresponding to the result above. 

/ 1 0 °\ 

Thus the matrix 1—2 1 0 I pre-multiplying 

\ 0 -2 1 / 

changes it into lower triangular form. 

Proceeding with the three succeeding transformations, 






r 2 — V 2 + 5r 3 , 


their combined effect is given in a similar way by the product of the ele¬ 
mentary matrices: 
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( 


1 0 0 

0 1 0 

0 0 -1 






The combined effect of all six matrices is given by 



This, pre-multiplying the original equation, gives: 


( 


1 2 
-2 -9 
0 2 




( 1 0 0 3' 

0 10-5 

0 0 1 - 1 , 


It will therefore be seen that, by making use of the six elementary matrices 
we have succeeded in showing that 


( 


1 2 
-2 -9 
0 2 



is the inverse of 



Formation of inverse matrix by row operations 


The above process gives a clue to the formation of the inverse of any non¬ 
singular 3x3 matrix, starting from the unit matrix and applying the 
appropriate row operations in turn. These are shown in the centre, with 
the corresponding matrices on each side: 


original 
matrix —> 



( 

( 


1 

0 

4 

1 

0 

0 



r 2 - 2 r x 


r 3 “ 4 r x 



0 

1 

0 

0 

1 

0 

0 

1 

0 



unit 

matrix 
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echelon 
form —► 


unit 

matrix —> 



inverse 

matrix 


In the process of forming the inverse, it is not necessary to pass through 
the echelon stage, so long as one contrives to produce zeros in the correct 
positions. Moreover, the process involving row operations could be replaced 
by one using column operations , but row and column operations must not be 
mixed, for column operations are equivalent to jfro^-multiplication by 
elementary matrices. 

This method of inverting matrices, by using either row operations or 
column operations, is basically the one used in practice, e.g., by computers 
in the case of larger matrices, the elements of which are possibly awkward 
numbers. Even using a computer, the method is a great deal shorterj than 
that of forming the adjoint matrix, which is chiefly of theoretical interest. 


Exercise 13.7b 


1 Find the inverses of the following square matrices, using the method of 
the preceding section: 



/ 2 2 -3\ 

(tv) 3 2 -10 1; 

\3 4 1/ 

h 1 ->\ 

(vn)\ 1 2 3 1; 

\l 3 -1/ 




/' 1 

(m) 1-2 

\2 3 



| See Neill, H. and Moakes, A. J., Vectors, Matrices and Linear Equations, Oliver and Boyd, 
1 967> P- 157 - 
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2 Use the results of no. i above to solve the equations in exercise 13 . 7 a, 
no. 1 . 

(Use row operations in most cases, but try using column operations in 
others.) 

(' 1 

3 Find the inverse of the matrix M — 13—9 5 

\l -3 3 

Rewrite the set of equations x — y + 2 = 3 

3* — 9y + 5z = 6 

x — 3y 3z = 13 

in a form using the matrix Af, and hence solve the equations. (m.e.i.) 

4 Use the method of this section to find the inverses of some of the matrices 
in exercise 13 . 4 a. 

5 Construct some examples of singular 3x3 matrices: 

(i) by arranging for the 3 x 3 determinant to have the value zero. 

(ii) by arranging the third row to be linearly dependent on the first two 
rows (i.e., r 3 = ar i + br 2 , see p. 430 ). 

6 Express the following simultaneous linear equations in matrix form 

x -f y 4 " 2 2 = 4 X -\- Y -Z = 3x 

y - 3z = -2 X - 2Y + Z = 3y 

2x + 5y — 5z = k Y — Z = 2 

Find the value of k for which they are consistent, and obtain the general 
solution for X, Y, Z in that case. Give a geometrical illustration. (m.e.i.) 


7 Solve where possible the equations 



8 What is the effect of the singular matrix 

/A 



on the points 


2 


of 3-D space? 
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9 Find a 3 x 3 matrix which takes all the points of 3-D space into the 
plane 2x — \y + z = 0. 


10 Find a 3 x 3 matrix which takes all the points of 3-D space into the 
line of intersection of the planes 2x — + 2 = 0 and a + y — 4z = 0. 


/ 2 -3 

11 Show that the matrix | 4—6 

\-3 H 

takes all the points of 3-D space into the line x:y:z = 2:4:—3. What do 
you notice about the (?) minors, (ii) rows and columns? 


*13.8 Further applications of matrices 

There are many applications of the use of matrices throughout mechanics 
and other branches of applied mathematics but we shall concentrate our 
attention on a group of problems involving the impact and collisions of a 
particle. 

Example 1 

The bouncing of a particle on an inclined plane 



Consider a projectile launched from a point Oj so as to land on a plane 
through 0 1 inclined to the horizontal at an angle a. If the magnitude and 
direction of the initial velocity are known, it is possible to calculate the 
range r, that is the distance between the point Oj and the point 0 2 where 
the projectile hits the plane. It is convenient to work with the components 
of the initial velocity along the plane and normal to it. Calling these u and v, 
it can be shown that 

2 ^ / 

r = - (u — v tan a). 

g cos a 

Of course, if u < v tan a, r would be negative, and you should have no 
difficulty in appreciating the meaning in this case. 

Next suppose that the ball bounces after landing at 0 2 after which it 
proceeds along another parabolic trajectory under gravity, and again hits 
the plane at a point O s , whereupon it rebounds and the process continues. 
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We may wish to compute the position of the successive bounces, and to 
know just what happens in the subsequent motion. 

The key to the problem is that we should like to be able to obtain the 
components (along and perpendicular to the plane) of the velocity of the 
ball as it sets out on each of its respective trajectories. For, once we know 
these, the above formula enables us to find where it lands at the end of that 
trajectory. 

If the velocity just after the first bounce has components u x , v x , it is 
easy to show that 

u x — u — 2v tan a 


v x = ev, 

where it is assumed that the plane is smooth, and that there is a coefficient 
of restitution e (0 < e ^ 1) governed by the elastic properties of the 
materials. These equations may be written 


1 — 2 tan a 

0 e 


so that we see the matrix M = 


— 2 tan a 


transforms the initial 


velocity represented by a column vector into the velocity vector following 


the bounce. This we may write v x = Mv. Similarly, if v 2 = 


is the 


velocity immediately after the second bounce, we have v 2 = Mv x . Com¬ 
bining these, v 2 — MMv — M 2 v , giving the velocity after the second 
bounce in terms of the initial velocity. Similarly, v 3 = M 3 v is the velocity 
after the third bounce, and v n = M n v is the velocity after the nth rebound. 

Thus, to find out what happens to the ball after repeated rebounds from 

, , , . , , , . i i . (1 — 2 tan 

the plane, we need to be able to multiply the matrix 


\0 e 


by itself repeatedly, i.e., we need to find M z \ M , A/ 4 , . .., or ‘powers’ 
of the matrix M. As an example, consider first the case when tan a = 
e = 1 (perfect elasticity). 


Then 




M 3 = 


and it is easy to see inductively that 




Thus if the initial velocity were given by the vector v 



we have 


v x = Mv = 


11 

4 


= m 2 v = rj; 


v 3 = M 3 v = ( ^ 
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»«- - ( 4 ‘) 

The range formula gives successive values of r to be: 

r i = 13A;; r 2 = 9k; r 3 = 5k; r 4 = k and so on, where 

k = 2v - iQ 

g cos a 9.8 x 2/y/5 

Evidently, after the third bounce, the ball starts coming down the plane. 
Had the coefficient of restitution been smaller (say £), our matrix would 
have been 

M= (o i) withM2= (i I 2 ) m3= (o >j’ andsoon ' 


In this case, with initial velocity the same, we obtain 



and successive values of r: 


r 1 = 13A;; r 2 = 10£; r 3 = 8 jk; r 4 = 7| k,. . . . 

Here we may well ask whether the value of r ever becomes negative — 
whether the ball ever leaps down the plane — before bouncing ceases for 
ever. 


Example 2 

Collision of elastic particles moving in a line 

We consider next another situation in mechanics, that of a number of 
billiard balls moving along a straight line, with impacts occurring between 
pairs of adjacent balls. We shall suppose that the balls are all of equal 
mass m, and that the coefficient of restitution between any pair is e. Suppose 
the two balls have speeds u x ,u 2 before impact and , v 2 after impact: 


Then, by conservation of momentum: 
mv x + mv 2 = mu x + mu 2 
and by Newton’s law of restitution: 
v 2 — v x — —e(u 2 — Uy). 
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Example 3 



u 


Vl 


w 1 


Repeated impacts of three balls moving in a line 


Suppose now we have three balls, A, B, and C of equal mass moving in a 
straight line, and that A first strikes B, after which B strikes C. We shall 
suppose the speeds to b eu 1 ,v li w 1 initially, u 2 ,v 2 , w 2 after the first impact 
between A and B, and finally w 3 , t/ 3 , w 3 after the second impact. We have, 
as in the previous section: 


_ 1 


2(1 — e)u x 4“ 2"(1 4- v)v\ — pu x 4~ qv x 
v 2 — |(1 4- e)u { + ^(1 — e)v x = qu i + pv x 

W 2 — w 1 


For the impact between B and C we have similarly, 


u 3 = u 2 

v 3 — 2(1 “ e ) v 2 "h 2"(1 + e)w 2 — pv 2 + qw 2 
w 3 = ^(1 -f e)v 2 4- ^(1 — e)w 2 = qv 2 4; pw 2 



Ip q °\ Ai\ 

y P ° I \vi V 

yO 0 1 j \ w l J 

( 1 0 0 \ / u 2 \ 

0 p ? II t> 2 )• 
0 q p) \w 2 J 


Abbreviating the notation: 

v 2 = M x v x 
v 3 = M 2 v 2 

and these when combined give the final velocities in terms of the initial 
ones: v 3 — M 2 M x v x , where the product matrix M 2 M X will be found to be 

ip q °\ 

[pq P 2 q I 
\q 2 Pq PJ 

This matrix describes the final result after the two successive collisions. As 
expected, M 2 M X ^ M X M 2 , so the final result would be different if the 
collision between B and C had preceded that between A and B. 


Example 4 

Perfectly elastic collisions 

The case e = 1 deserves special attention. Here we have 
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/° 

1 

°\ 

/' 

0 

°\ 

h 

0 

°, 

o 

II 

CM 

§ 

0 

■ 

\0 

0 

i / 

\o 

1 

0 / 


and these are permutation matrices, confirming the fact that the pairs of 
colliding balls exchange velocities upon impact. The diagram 

ABC 



shows the velocities after successive collisions between A and B, B and C, 
and a further collision between A and B, in the case when u l > v x > w x . 
Evidently no .further collisions can occur because after the third impact 
both pairs of adjacent balls are separating, with u x > v x and v x > w 1 . 
It is interesting to extend to the case when there are 4 (or more) perfectly 
elastic balls moving in a line. The matrices will then be 


( 0 1 0 
1 0 0 
0 0 1 
0 0 0 



for an impact between A and B; 




for an impact between B and C; 


and 


( 1 0 0 
0 1 0 
0 0 0 
0 0 1 



for an impact between C and D. 


The order in which impacts occur will depend upon the initial position 
and initial velocities of the 4 balls. If 6 impacts occur in the sequenc 
AB, BC, CD, BC, AB, BC, the velocities follow the sequence: 
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the final velocities z, w , v, u being the original ones in the reverse order. This 
is shown graphically in the figure where a displacement time graph is 
given for each ball. At each impact, the two bodies ‘swop graphs’. Clearly, 
since four lines intersect in at most six points, there can be no more than 
six impacts under the most favourable conditions, with u > v > w > z. 
This may obviously be extended to the case of five balls moving in a line, 
when the maximurq, number of possible collisions will be 10 (— 5 C 2 ). 

Example 5 

As a final example, which may be answered by the use of matrices and 
which would be very difficult by any other means, suppose one considers 
the following problem. 

A body is first given a rotation through 180° about an axis, and then a 
quarter-turn (through 90°) about a second axis, the two axes intersecting 
at a point O and making an angle 6 with each other. It seems a reasonable 
assumption that these successive rotations may be replaced by a single 
rotation about some other axis through O. We now ask whether it is 
possible for the angle 0 between the axes of rotation to be chosen in such a 
way that the single rotation will be through + 120°. In other words, can a 
half-turn, followed by a quarter-turn, ever lead to a one-third turn? 

There are probably not very many people who can visualise the solution 
of such a problem, as it would require much more three-dimensional 
insight than any but the most gifted possess. Even if one were allowed to 
use apparatus — rods, matches, pieces of string, cardboard, etc. — one 
would still be hard put to it to obtain a solution. The reader, trusting his 
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intuition, may well wonder whether the problem is soluble, and may like 
to perform experiments to discover whether his intuition has misled him 
before reading on. 

Using matrices, however, the solution is comparatively simple. We are 
concerned with rotations about two axes intersecting at an angle 0. We 
therefore choose our coordinate system so that the origin is the pdint of 
intersection of the lines, one of them is the axis of x, and the other lies in the 
plane of the x- and y-axes, i.e., the plane z = 0, so that it has equations 
z = 0, y = x tan 0. 


y 


Our first rotation, through 180°, will be about this axis, and in order to set 
up the matrix, we must enquire the destinations of the points U, V, W 
under this half-turn. Clearly the point W' has coordinates (0, 0, —1), so 
the third column of the matrix is determined. 




A 4 plan ’ view on to the xOy plane shows that the half-turn takes the point U 
to U' with coordinates (cos 20, sin 20, 0), while V' (remembering that 
Z_U'OV' = 90°) will be found to have coordinates (sin 20, —cos 20, 0)4 


f Compare the matrix for the two-dimensional reflection in a line through the origin; see 13.2. 
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Thus the matrix representing the half-turn is 


( cos 20 sin 20 
sin 20 —cos 20 

0 0 



Since our second rotation is about the x-axis, it will be easier to find the 
matrix, but we must specify in which sense the quarter-turn is to be carried 
out (this did not matter in the case of the half-turn, of course). If the sense 
of the rotation is that of a right-hand screw in the positive direction of the 
x-axis, i.e., such as to carry the point V to move into coincidence with the 
point W, then we can easily see that U' is (1, 0, 0), this point being invariant 
under the rotation, V' is (0, 0, 1), and W' is (0, —1,0). Thus our second 
matrix is 

/, 0 ox 

B = I 0 0 -1 . 

\o 1 0/ 


The combined effect of the two rotations will be represented by the product 
matrix BA (first A, then B , since the matrices are pre-multiplying column 
vectors), and we have 


BA 


0 

0 

1 



sin 20 
— cos 20 
0 


0 N 

0 


- 1 / 


cos 20 

sin 20 

°\ 

0 

0 

1 )= c 

sin 20 

— cos 20 

0/ 


The question is: can C conceivably represent a third-turn, i.e., a rotation 
through ± 120°? In order to answer this, we first consider the matrix C 2 , 
which should represent a rotation about ±240°: Writing c = cos 20, 
s = sin 20, 



Now if C is to represent a 120° rotation, C 3 must be the identity matrix , and 
the equation C 3 = I would mean that the nine equations: 



13-8 FURTHER 

c 3 + s 2 = 1 (i) c 2 s — cs 

cs = 0 (w) s 2 

c 2 s — cs = 0 {vii) cs 2 + c 2 
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= 0 

(«) 

cs 

= 0 

{in 

= 1 

(v) 

— c 

= 0 

(vi] 

= 0 

{viii) 

s 2 

= 1 

(ix) 
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ought all to be satisfied. It may well be that there does not exist an angle 9 
which satisfies all these equations. However, our fears on this score are soon 
dispelled. The key to the solution is equation {vi), requiring that c = 0, 
in which case (ii), {Hi), {iv), {vii) and {viii) are automatically true. The 
remaining three equations (i), {v), and {ix) then all reduce tos 2 = 1, which 
is consistent with c = 0, and we summarise our findings: 

cos 29 = 0; sin 26 = + 1. 


These are satisfied by 9 = 45° (and by 135°, as well as other values of 9 
which need not concern us). Thus we have verified that a rotation through 
180°, followed by a rotation through 90° about an axis inclined to the first at 
45°, may be together equivalent to a single rotation through 120°. 

We may go further and use matrices to locate the axis of this single rotation. 
This is not difficult, for points on the axis of rotation are unchanged, or 

/ x\ 


invariant , under the rotation. Thus if x = I y ] is such a point, we have 


Cx = x. 


( 0 1 O' 

0 0 1 

1 0 0 


(taking sin 29 = T 1), 


and so 


f 





and such a point therefore satisfies the equations x = y = z. Clearly the 
axis of rotation is a line making equal angles with the coordinate axes, and 
in fact passes through the point (1, 1, 1). Note that the matrix C is a 
permutation matrix, its effect on the coordinates of any point being to 
‘move them round one place’ — a cyclic permutation — whereby x is 
replaced by y , y by z, and z by *. Evidently this is geometrically a rotation 
through 120°. 

If your experiments with matches and string failed, you may now care to 
demonstrate the geometry of the above by using a cube as shown overleaf: 

The diagram should be self explanatory, and you may of course perform 
the experiment with an ordinary die. 
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9 



01 

3 

<N 


Net of cube 


1 




Exercise 13.8 

1 Find the condition on e for three particles of equal masses moving in a 
line as in example 3 to collide more than three times. Is there any limit to 
the number of collisions that are possible when e is small? 

2 When a rotation through 2n/p about one axis through O is followed by a 
second rotation through 2ii/q about another axis through O, the effect is 
that of a single rotation through 27i/r. In example 5 we saw that if/> = 2, 
q = 4, it is possible for r to have the value 3. Use matrices to find other 
possible sets of values of p and q for which r is rational. 


Miscellaneous Problems 13 

1 O xyz make a set of mutually perpendicular right-handed axes in three- 
dimensional space. The transformation Q l is a quarter-turn about the 
ray Ov, its sense being clockwise if one looks out from the origin along the 
ray; and Q 2 is a quarter-turn about a ray from (0, 0, 1) parallel to O y, 
clockwise if one looks out from (0, 0, 1) along the ray. Find the images of a 
general point (x, z/, z) under Q { an d under Q 2 , and hence find the image 
of (*, y, z) under Q 2 Q l (i.e., & followed by Q 2 ). Prove that the trans¬ 
formation g 2 q. has no fixed point. 

Give the equations of four planes with the property that M 4 M 3 M 2 M 1 = 
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s 2 a., where M t denotes the reflection in the ith plane. Explain why 
aa cannot be expressed as the product of fewer than four reflections. 

(s.M.P.) 

2 Square matrices A , B of the same order have the property that A T A = /, 
B t B = I (where A T , B T denote the transposes of A, B , and I is the unit 
matrix). Prove that: 

(i) If C = A \ then C T C = /; («) IfZ> = AB , then Z> r Z) = /. 

What can you deduce about the set of matrices of the given order such that 
X T X = /? (s.m.p.) 

3 A is a non-singular 3 x 3 matrix; A T denotes the transpose and A ~ 1 is 
the inverse. Prove that: 

(. AB) t = B T A T and that ( A 7 )' 1 = {A~ l ) T . 

Denote by A * the matrix ( A T ) 1 . Operations e, r, 5, t are defined on the 

set of all non-singular 3 x 3 matrices by 

e(A) = A, r(A) = A*, s(A) = A~\ t{A) = A T . 

If tr denotes the composition ‘first r, then t\ prove that tr — justify the 
other entries in the composition table, and complete the table. 



t t s r e (m.e.i.) 


4 (A statistical application of the transpose of a matrix.) 

Suppose we have two variables x, y, and a number of observed pairs of 
values have been recorded: (x i ,y i ),i = 1, 2, . . ., n. It is supposed theoreti¬ 
cally that the points ought to lie on a straight line, whose equations we will 
take to be y = mx + c. We cannot of course satisfy all the n equations, 



for the two unknowns m and c. However the best values of m and c according 
to the ‘ principle of least squares 5 may be found by pre-multiplying by the 
transpose of the n x 2 matrix, thus: 



470 MATRICES, DETERMINANTS, AND LINEAR ALGEBRA 

This gives 



Proceed to show that the values of m and c determined by this equation are 
those which make the line pass through the mean centre of the n points, 
and have gradient m, where 



(see gradient of regression line of y on x, p. 354). 

Find the equation of the regression line of y on x through the five points 

(-3,-1), (-2, 3), (2,2), (5, 6 ), ( 8 , 5). 

5 Solve for x , y, z the equations: 

x + 2 y — z = 4 

3v — 4y + 5z = — 2 

5 x — 5 y + Iz = — 1 . 

Describe geometrically the intersection of the three planes given by these 
equations. (s.m.p.) 

6 Write a short essay on the solution of three simultaneous linear equations 
using the method of pivotal condensation. Include in your essay: 

( i ) at least one example to illustrate the method, 

(ii) a brief description of the nature of‘ill-conditioning’. (m.e.i.) 

7 What condition must be satisfied by the constants a , b, c, /, m, and n in 
order for the system of equations 

— ny + mz = a 

nx — Iz = b 

— mx + ly = c 

to have solutions? 

If this condition is satisfied, show that any solution of the equations will 
then also satisfy the additional equation ax + by + cz = 0, and write 
down the most general solution in this case. 

Do any of these solutions also satisfy the equation lx + my + nz = 0 ? 

8 A 4 x 4 determinant contains 24 terms. When there is a single zero 
among its 16 elements, there will be 24 — 6 = 18 non-zero terms. Find 
the greatest and the least possible number of non-zero terms when three 
zero elements are present, showing where they may be placed. 
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9 If n is a positive integer, show that b — c,c — a, a — b are factors of the 
determinant 

1 1 1 

a b c 

a n + 2 b n +2 2 


and that the remaining factor is the sum of all products of a , />, c of degree n 
(with repetitions of 0, b , c included). 

Obtain the corresponding result when the last row of the determinant is 
replaced by a~ n , b~ n , c~ n . (o.c.) 


10 Given that 


a 2 + 1 
a 3 x 2 + 1 
1 x 2 + a 2 


1 

1 

1 


= 0, 


show that x = 1, or x = a, or x = —a!{a 4- 1) will satisfy the equation. 
Are there any other roots? 


11 Prove that the determinant A_ of order n 


2 cos 0 1 0 0 

1 2 cos 6 1 0 

0 1 2 cos 6 1 


0 

0 

0 


. ... 1 2 cos 9 1 

0 1 2 cos 6 


satisfies the recurrence relation A n+ x = 2 A„ cos 6 — A n _ j. 

sin (n + 1)0 


Prove that, if 0 7^ kn, where k is an integer, then A n = 


sin 0 


If the term in the top left-hand corner is changed to cos 0 , what is the value 
of the determinant? (o.c.) 


12 Expand the determinants 


a b 
b a 
0 b 


0 

b 

a 


a b 0 0 
b a b 0 
0 b a b 
0 0 b a 


and continue with similar determinants of order 5 , 6, ..., etc. 
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Show that the determinant 


110 0 0 
1110 0 
0 1110 
0 0 111 


must have the value 0, 1 or — 1. 



*4 


Introduction to algebraic 
structures 


14. i The idea of a structure 

In 9.5, we proved that, if A, B, C, D are four points in space such that 
AB _L CD and AC _L BD, then AD _L BC. This was done by using the 
scalar product of vectors, and during the course of the proof we made use 
of the distributive property of scalar product, 

a. (6 - c) = a . b - a.c , 

whereby vectors behave like ordinary numbers in the relationship 
a(b — c) = ab — ac. 

The property of distributivity of scalar product over subtraction was not 
something we could take for granted, but had to be established, and it was 
essential that our dependence on the result should be acknowledged in the 
course of the proof concerning the orthocentric tetrahedron. Once dis¬ 
tributivity has been proved, however, we have available this powerful 
vector tool, enabling us to produce a very concise and elegant proof. 

This draws attention to the importance of what has come to be called 
structure , which (roughly speaking) refers to the various rules which the 
elements of a set — the set of three-dimensional vectors in this case — 
have to obey in order that the system may behave itself in a manner which 
avoids inconsistency or contradiction. We know of two methods of 4 multi¬ 
plying’ vectors — the scalar, and the vector, products.I These are good 
definitions precisely because they lead to an orderly structure. One might 
try to invent other ways of ‘multiplying’ vectors; e.g., AB x CD = the 
product of the lengths AD and BC. Such a definition will lead to chaos! 

t Note the structure of grammar here: ‘ the scalar and the vector products ’ is short for ‘ the 
scalar product and the vector product’, so that the word ‘products’ distributes over ‘and’ in 
this case. Again, in ‘W. D. and H. O. Wills’, the surname distributes over the conjunction 
‘and’; so too in ‘Mr and Mrs Wills’. 
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For a start, if AB and A'B ' were ‘equivalent’ vectors (having equal length 
and the same direction), the ‘product’ A'B' x CD would be different 
from the ‘ product ’ AB x CD , when we feel we have a right for it to be 
the same. In the second place, AB x DC would be AC x BD when one 
would rather expect it to give the negative of AB x CD. It is true that 
AB x CD = CD x AB, i.e., our ‘product’ would be commutative. But 
this is by no means essential, since such a useful definition as vector 
product (see 11.5) is in fact not commutative. Much more important, we 
should no longer be able to claim distributivity of x over + and —, for 
AB x ( CD + DE) = AB x CE = AE x BC ; whereas AB x CD + 
AB x DE = AD x BC + AE x BD, and these are by no means equal. 

The simpler features of structure such as commutativity and associativity 
may appear to be of little interest — the smallest child can quickly learn 
to appreciate that 8 + 3 = 3 + 8 (i.e., that addition is commutative ), that 
2 x (3 x 5) = (2 x 3) x 5 (i.e., that multiplication is associative) and 
so on, and the realisation of these fundamental laws quickly becomes a 
matter of habit. Even so, the question of commutativity is not trivial and 
loses its aura of inevitability when one considers non-commutative opera¬ 
tions, such as subtraction and division of numbers and the multiplication 
of matrices. In the case of subtraction, the small child may become con¬ 
fused if asked ‘By how much does 253 fall short of 527?’ and commonly 
enquires about which number to put at the top. 

The idea of distributivity, involving both the operations of + and x , is a 
little more complicated. A young child may appreciate it when dealing with 
a problem like, ‘ How much profit is made by a man who buys 50 articles 
at 28p each and sells them at 40p each?: 

Profit = 50 x 40 - 50 x 28 = 50(40 - 28) = 50 x 12 = 600p = £6, 

and repeated practice in this sort of situation eventually imprints on his 
mind the idea of distributivity of multiplication over subtraction as being 
normal and inevitable. It is possible that he may in some other situation 
make a false assumption about distributivity, as for example if he says 
(, ab)c = ac x be (making the assumption that multiplication distributes 
over multiplication); or that {a + b ) 2 = a 2 + b 2 , which wrongly presumes 
that ‘squaring’ distributes over addition. Indeed a very common error is 
what may humorously be called the ‘universal law of distributivity’, 
c ° (a + b) = c°a + c°b, where ° is some operation; e.g., ‘exponentia¬ 
tion’: 2 x+y = 2 X 4- 2 y (!) or‘sine’: sin (x + y) = sin x + sin y (!), heresies 
not unknown in school work. One of the virtues of studying structure is, 
hopefully, that this type of‘structural’ error is less likely to be perpetrated. 

It is interesting to trace the structural implications in a simple arith¬ 
metical calculation, such as the multiplication 24 x 3: 

For the formal layout 24 
x 3 


72 
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is essentially an abbreviation of 
24 x 3 = (2x10 + 4) x 3 
= (2 x 10) x 3 + 4x3 
= (2x 10) x3 + 12 
= (2xl0)x3 + (1x10 + 2) 

= 2x (10x3) + (1 x 10 + 2) 

= 2 x (3 x 10) + (lx 10 + 2) 

= (2 x 3) x 10 ,-f (1x10 + 2 ) 

= 6x 10 + (1 x 10 + 2) 

= (6x 10 + 1 x 10) + 2 
= (6+ 1) x 10 + 2 
= 7x 10 + 2 
= 72 . 

You should check in this analysis the stages at which you have used the 
associative, commutative and distributive laws of addition and multi¬ 
plication, and then be grateful that we do not usually need to take such a 
scrupulous view of elementary arithmetic! 

Instead of building up our idea of structure from primitive beginnings, 
let us now approach it from the starting point of a very highly organised 
structure — the field. The most familiar example is perhaps the set of 
rational numbers, Q , with the operations’)* of addition ( + ) and multiplica¬ 
tion ( x), obeying the following basic requirements, which constitute the 
axioms for a field, (where it is to be understood that x, y, z denote any choice 
of elements from our set). 


A1 

(* + y) + 2 = x 4- (y + z) 

Associative law of addition 

A2 

x + y = y + x 

Commutative law of addition 

A3 

x-|-0 = x = 0 + x 

Existence of identity element 
(zero) for addition 

A4 

x -1- (— x) = 0 

Existence of inverse under 
addition (the ‘negative’ of*) 

Ml 

V 

X 

X 

II 

N 

X 

IS 

X 

Associative law of multiplication 

M2 

X 

II 

X 

Commutative law of 
multiplication 

M3 

* 

X 

II 

II 

X 

Existence of identity element 
(unit) for multiplication 

M4 

x x x 1 = l (x ^ 0) 

Existence of inverse under 
multiplication (the 4 reciprocal 5 
of x) 

D 

*x (y + z)=xxy + xxz 

Distributive law 


f i.e., closed operations (see p. 478). 
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It will be seen that the first four rules (A1-A4) are concerned with addition, 
M1-M4 with multiplication; D is the distributive law of multiplication over 
addition (to which we have chiefly referred) and which relates addition 
with multiplication. These laws govern the behaviour of the rationals 
under the ordinary manipulations of algebra: subtraction is taken care of 
by the application of Al, A2, A3 and A4. For if we ask ‘What must be 
added to x to obtain y?\ we can easily show that 


x + 2 = y 


— x+(x + z) = 
{-X + x) + 2 = 

0 4 z = 
z = 


— x + y 

— x + y 
-x + y 

— x + y = y + {—x). 


In the same way, if x x z = y, we may show that z = x~ l y = yx~\ 
and this is a formal way of writing the division y -y x. We may loosely 
describe a field as a number system in which addition, subtraction, multi¬ 
plication and division may all be carried out, with the exception of division 
by zero. Of course, it may be that some of the laws for a field as given 
above may be redundant: it is possible, for instance, to dispense with the 
second half of A3 and derive it from the first half of A2. When there are no 
such redundancies, we should have what may be called a minimum set of 
axioms. 

The above was chosen as an example, first because the field is one of 
the fullest and most satisfying structures, and also because the rationals are 
so familiar. You will readily check that the reals R form a larger field (of 
which the rationals Q are a sub-field), and the complex numbers C again 
form a field which contains the reals as a sub-field. Do not imagine, how¬ 
ever, that fields are necessarily infinite: it is common to have finite fields, 
and you may care to show that the two numbers {0, 1} form a field when 
addition and multiplication are defined, modulo 2, thus: 


+ 

0 

1 

X 

0 

1 

0 

0 

1 

0 

0 

0 

1 

1 

0 

1 

0 

1 


Clearly, however, the integers Z do not form a field, since multiplicative 
inverses do not exist for numbers other than + 1 or — 1. So M4 is not satis¬ 
fied and the integers with the operations + and x form a 4 lower ’ structure, 
known as a ring. By contrast, there do exist even more highly organised 
structures than fields, e.g., vector spaces and modules , but these need not 
concern us here, and we shall concentrate upon the most important of all 
mathematical structures, known as a group. So central is this that its study 
forms the backbone of the present chapter, but first we must discuss more 
fully the idea of a binary operation. 
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Exercise 14.1 

1 Consider the long multiplication 

2 7 5 
7 9 

2 4 7 5 
19 2 5 

2 17 2 5 


and analyse the uses of the field axioms in the course of the work in the 
same way as was done in the case 24 x 3 = 72 above. 

2 Revise the proofs of a. (b + c) = a.b + a.c 

and a X (b + c) = a X b + a X c, 

where a, 6, c are vectors in three dimensions. 

3 Show that {0, 1, 2, 3, 4} is a field under addition and multiplication 
modulo 5, zero being excluded in the case of multiplication, but that 
{0, 1, 2, 3, 4, 5} cannot be made into a field with addition and multiplica¬ 
tion modulo 6, even when zero is excluded. 

14.2 Binary operations 

In the set of axioms for a field given in 14.1, we were dealing with a familiar 
example in which x, y , z represented numbers. One of the deficiencies of 
traditional school mathematics was the impression given that letters always 
stood for numbers. But here you have already encountered many examples 
of the use of letters for other entities — sets, matrices, rotations, points, 
vectors, permutations, lines, curves, ordered pairs, and so on. We may 
now ask whether sets of such objects may be organised into structures. 

That the answer to this question must be 4 yes 5 you cannot have doubted. 
But whatever the nature of the elements of the set which is being provided 
with a structure, the fundamental step to such provision is the idea of a 
binary operation or law of composition , by which we can take any two elements 
of the set and combine them in some way, according to some rule. Thus, for 
example, multiplication is a binary operation on the set of integers, for 
when one takes two integers such as —3 and 2 and combines them by this 
particular binary operation, one obtains the integer —6, which is also in Z, 
and we write ( — 3) x ( + 2) = — 6, the binary operation of multiplication 
being represented by the familiar symbol x . More generally, if the binary 
operation is represented by *, we may describe the result of combining 
members x and y of the background set S formally as follows: 

z = x * y 

and this states that the result of applying the binary operation * to the two 
elements x and y of S is to produce the element z. If (as in the multiplication 
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of integers) the result of an operation is always in the original set S, then 
we say that our operation is closed ; but if there is some z which is not in S, 
then the operation is not closed, as example, S = Z (the integers) and * is 
the operation of division. Then ( —3) * ( + 2) = — 1^. But — 1^ is not in 
the set Z, so the operation -r- on the integers is not closed. Again, suppose ** 
denotes ‘exponentiation 5 (as in the computer programming language 
Fortran), then ( — 3) ** ( + 2) = ( — 3) 2 = +9. However, ( + 2) ** ( — 3) = 

2 “ 3 = \, and this operation would be closed if our set had been the reals R , 
but is not closed when we confine our attention to the integers Z. (Note, in 
this case, that we must be careful to exclude the case x — y — 0, for no 
value can be assigned to 0°). 

One may also describe a closed binary operation on a set S as a mapping 
from ordered pairs of the set into members of the set itself: (#, y) i->* z 
where x,y, z e S. If however, the binary operation is not closed, the image 
may be outside the set, as for example in the case of the operation . on the 
set of vectors, which is a mapping from the ordered pairs of vectors into the 
reals, R. 

It is important to realise that by the use of a binary operation, an element 
may be combined with itself \ and that x * x must be defined for all x in S : 
e.g., if S = R and * is multiplication, then x * x = x 2 ; again, if S = Z + 
and * is the operation of finding the h.c.f. of x and y , then in the case 
x = we have x * x = x for all x in Z + (since the h.c.f of a positive 
integer and itself is that same number). Moreover, if z = x * y, it is per¬ 
fectly possible for z to be equal to x or y. For, if ** is exponentiation, 

3 ** 1 = 3, and 1 ** 3 = 1. 


Linear transformations 

We now consider an example where the elements of the set are not 
numbers but two-dimensional linear transformations, such as rotations, 
translations, reflections, shears, or enlargements. Given two elements in 
the set of all such transformations, how do we define a binary operation ° on 
them? Simply by considering the composition of the movements, one after 
the other. So if 7\ and T 2 are two transformations, we shall denote the 
result of first performing the transformation T l , and then performing T 2 
by T 2 ° Tj, the reason for this order being our similar practice when 
linear transformations were represented by matrices (see 13.2). Suppose 
now that R x represents a +90° turn about O t , and R 2 represents a +60° 
turn about point 0 2 in the plane of the rotations. Then in the case O t = 0 2 , 
R 2 °R x evidently represents a +150° turn; and also, clearly R 2 oR^ = 
R x o R 2 . In the case when Oj and 0 2 are distinct, intuition tells us that 
R 2 ° R x is a + 150° turn, but it is not immediately clear about which point 
the rotation is centred. A little experiment should however convince you 
that R 2 ° R x ^ R x © R 2 . 

Again, suppose our set of transformations consists of all possible rotations 
in the plane, through any possible angle about all possible points. In view 
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of the above informal discussion, it may seem that this set is closed under the 
operation °. However, this is not so, and it is not difficult to produce a 
counter-example: consider a positive 90° turn about A followed by a 
negative 90° turn about a different point B : 


Y 



It is clear from the figure that the composition of the two rotations is not a 
rotation, but a translation 7", for after rotation through two equal and 
opposite angles, the orientation of the figure is restored —- the sort of 
manoeuvre might be executed in the course of a dance, at the end of which 
the dancer would be facing the same way. Evidently, the composition of 
rotations about different points through equal and opposite angles will 
always be a translation, so that the set of plane rotations is not closed under 
composition. Closure may be achieved, however, by adding the plane 
translations to give a set of transformations comprising both rotations and 
translations. Notice, however, that the subset of translations alone is closed 
within itself. 



As another illustration, consider the half-turns (180° rotations) of a 
cuboid about its three perpendicular central axes, and let us call these 
X , F, and Z. Here is a set containing only three objects, and we consider 
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whether we have closure. A simple experiment with a matchbox will con¬ 
vince you that 1° 7= FqI= Z, with four other similar relations. But 
our hopes of closure are dashed when we remember that X ° X is neither 
X nor Y nor Z, but has the effect of restoring the original position of the 
box. This is the identity transformation , which may be called /, and it must be 
included in our set if we are to have a closed system. The table below shows 
the way in which our four transformations combine: 


0 

I 

X 

Y 

z 

/ 

I 

X 

Y 

z 

X 

X 

I 

Z 

Y 

Y 

Y 

z 

I 

X 

Z 

Z 

Y 

X 

I 


Permutations 

As another illustration of a set with a binary operation, we consider the 
composition of permutations, a most important idea in mathematics. 
A finite permutation is strictly a mapping from a finite set on to itself, such 
as 




Here there are five elements A, B, C, D, E, and the arrows indicate that the 
element A is ‘mapped’ into Z), D is mapped into E, and E into A , while 
B and C are interchanged. It is perhaps convenient to think of a permutation 


as a ‘rearrangement’, and to represent it as 


A B C D E\ 
D C B E AJ 


. Calling this 


permutation p , suppose now we wish to combine it with another permuta¬ 


tion q which replaces the letters according to the scheme 


A B C D E\ 
C E B D AJ 


Consider the effect of first performing the permutation p , and then following 
this by the permutation q. The effect will be that A will be replaced first 
by Z), and finally by D\ B will be replaced by C, and subsequently by B , 
and so on. The final effect of the composite permutation arising from the 


successive application of first p then q will be thus: 


'A B C D E\ 
D B E A Cj 


. Calling 
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this permutation r, and adhering to the rule of writing the first operation 
on the right, we may write 

(A B C D E\ 

r ~ q ° P ~\DB EA c) 

where ° represents the binary operation of composition of the permutations. 
You should verify that 

(A B C D E\ 

S ~ P ° q ~\B A C E Dy 

showing that composition of permutations is not commutative. 


Composition of functions 

As another example, we may consider types of binary operations on the 
set of functions. 


If /(*) = 3 - x and g(x) = - - 

l — X 


are two functions, then one simple way of combining them is by straight¬ 
forward addition: 


/0) + g[x) = 3 - X + -- 

2 — x 


6 — 4x + x ^ 
2 - x 


this being denoted [f + g) (*). 

Again, if f (x) — In x and g(x) = sin x, the function f H- g would be 
In x + sin x. However, we are perhaps more familiar with the composition 
of functions by successive substitution, (f ° g) (x) being defined as^Q^x)] 
so that in the latter example above we get the familiar In (sin x), whilst 
in the former case, 


l f°g)(x) 



6 — 4x 
2 — x 


Note that this operation o is not commutative, for 


(g °f) ( X) = 


3 — x 


3 - 


2 - (3 - x) 




Exercise 14.2a 

1 Consider the set of permutations of five letters which interchange two 
pairs, such as (AB) (CD), which changes ABCDE into BADCE. Is this set 
closed under composition of permutations? 
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2 Is the following a valid binary operation on the set of rationals? 

a c a + c 

7 * - = -- - where a , b, c, de Z. 

b a b + a 

Is the following a valid binary operation on the set of points in a plane? 
Given P, Q, P * Q is defined as the mirror image of P in OQ, where O is 
a given fixed point. 

3 Find which permutation (s) must be added to the following set in order 
for the set to be closed under composition of permutations: 

1 2 3 4 5 6 

4 1 2 3 5 6 

3 4 1 2 6 5 

2 3 4 1 6 5 

4 Show that the set of 3 x 3 matrices which have positive determinants 
are closed under matrix multiplication. Are they closed under matrix 
addition? Consider also the set of matrices whose determinants are ( i) nega¬ 
tive, (ii) zero. 

5 Consider the set of 120° rotations of the regular tetrahedron about its 
altitudes. Is this closed under successive composition? 

6 Is the set {1, 3, 5, 9, 13} closed under multiplication mod 14? Is there 
a subset which is closed? 

7 Consider the closure or otherwise of the following number systems 

under the operations +, —, x , : 

(0 N; (ii) Z; (iii) Z~ ; ( iv) Q; (v) Q + ; (vi) R; (vii) R + ; 

(■ viii ) C ; (ix) the irrationals; (*) cos 9 + i sin 6 (0 < 6 < 2 n); 

(xi) x + iy (x,yeZ). 

8 Is the set of 2 x 2 singular matrices closed under matrix addition, 
under matrix multiplication? 

9 In the following cases, find (f + g),f°g,g °/,/ 2 and g 2 : 

(*) /(*) = g(x) = 7 -^—; («) /(*) = In*, g(x) = sinx; 

x — 3 

(iii) f(x) = — - g(x) = -x; 

(iv) f(x)= 2 x—l, g(x) = 5 — 2 x. 

10 Is the operation x * y = x y closed in ( i) Z + , (ii) Z; (iii) Q; (iv) R + ? 

11 Show that functions of the type f(x) = (ax + b)/(cx + d) are closed 
under function composition. 
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12 Construct a finite set of 2 X 2 matrices which is closed under matrix 
multiplication. 

13 Consider the operations on vectors: (z) vector addition; (ii) scalar 
product; (Hi) vector product. Which of these are closed and which not 
closed (a) in two-dimensional space; ( b ) in three-dimensional space? 


S denotes the set of matrices of the form 


where x and y 


are real. Investigate whether, for the operation of matrix multiplication, 
(2) S is closed; (ii) the identity belongs to S ; (Hi) every element of S 
has an inverse. (s.m.p.) 


Associativity and commutativity 

Once we have a set with a closed binary operation (or possibly more than 
one operation), we have what may be called an algebra. If the existence of 
the operation is the only structural feature, then the algebra is very primi¬ 
tive. But usually the operation satisfies a number of axioms which make the 
structure more interesting. 

Generally speaking, the most desirable property, apart from closure, of a 
binary operation is that it should be associative , that is 

Va, b, ce S, (a * b) * c = a * (b * c). 

If this feature is lacking, the structure is unlikely to lead to fruitful mathe¬ 
matical results (though the operation of vector product is a notable excep¬ 
tion). On the other hand, commutativity , 

V 0 , b 6 S, a * b = b * a 

can often be dispensed with, and many examples of non-commutative 
operations which are nonetheless mathematically powerful have already 
been encountered. 

It is usually a simple matter to see whether an operation is commutative 
by inspection, but often a laborious process to check for associativity. For 
example, consider the binary operation on R defined by the formula 

v x + v 2 .,11 II 

V x * v 2 = - -’ with k/J < 1, \v 2 \ < 1 

1 4- v x v 2 

(a formula used in special relativity theory). We wish to show that * is 
associative, i.e., that 

(v 1 * v 2 ) * v 3 = v x * (v 2 * v 3 ). 

v x + v 2 

, TT - ^3 

v x + v 2 1 + V x v 2 

- * V 3 = - 

1 + VyV 2 J K + t>2>3 

1 + v x v 2 

v x + v 2 4- v 3 -h v x v 2 v 5 

1 + 12^2 + ^ 1^3 + V 2 V 3 


Now (v x * v 2 ) * v 3 
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and 


V\ * (v 2 * v 3 ) = v 1 * 


^2 + ^3 
1 “h ^'2^3 


1 + v 2 v 3 

J + «l(«2 + H) 

1 + v 2 v 3 

v 2 -\- v 3 V^V 2 V 3 

1 1 ^4^2 4 ~ -j - ^2^3 


so the operation is associative, as well as being obviously commutative. 
On the other hand, the operation x ~ y = \x — y\ is commutative but 
not associative, as may be confirmed by a single counter-example: e.g., if 


x = 2, y = 1, z = 4, 

then (x ~ y) ^ z — \2 — 1 ] ~ 4 = 1 ~ 4 = |4 — 11 = 3, 
but ^ - (y ~ z) = 2 - |1 - 4| = 2 - 3 = |2 - 3| = 1. 

Usually there is no short cut to discovering whether or not a particular 
operation is associative and it cannot be ascertained by inspection. How¬ 
ever, one should remember that associativity is always guaranteed in the 
following cases: 

(i) multiplication and addition in a number system, including finite 
arithmetics (see 14.3); 

(: ii ) composition of mappings (geometrical transformations, permutations, 
matrices, vectors, etc.). 


Exercise 14.2b 

1 State whether each of the operations in exercises 14 . 2 a, nos. 2 , 10 are: 
(a) commutative; (b) associative. 


2 Discover which of the operations on the reals described by the following 
formulae are (a) commutative (b) associative: 

/•x x y — 1 1 — xy 

( 1 ) x * y = - x * y = - [x + y # 0); 

x + y x + y 

(ii) x * y — k(x 4- y ); (Hi) x * y = \x — y \; ( iv) x * y = x; 

x + y 


(v) X * y = 


xy 


(x, y 7^ 0); (vi) x * y = x x | y \; 


(vii ) x * y = x y . 


3 Show that the non-commutative operation shown in the table is not 
associative: 


* 

a 

b 

c 

a 

a 

b 

c 

b 

c 

a 

b 

c 

b 

c 

a 
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4 Show that the operation * on the set {a, b, c, d, e} where products are 
given in the following table, is not associative: 


* 

a 

b 

c 

d 

e 

a 

a 

b 

c 

d 

e 

b 

b 

c 

e 

a 

d 

c 

c 

a 

d 

e 

b 

d 

d 

e 

a 

b 

c 

e 

e 

d 

b 

c 

a 


5 Show that, for vectors in three-dimensional space, vector addition is 
associative, but that vector product is not. 

6 Prove the associativity of the following law of composition for pairs of 
reals: 

(*1 ,y 1) * (*2> Vi) = (*1*2 - y2 > x iV2 + x 2yi)- 

7 Show that the operation on ordered pairs of reals defined by 

(*1 ,yi)*(x 2 , y 2 ) = ( x i x 2 , x 2 y 1 + y 2 ) 

is associative, and find whether the operation on ordered triples defined by 
(x l9 y l9 z x ) * {x 2 ,y 2 ,z 2 ) = (x x + x 2 + y 2 z l9 y 1 + y 2 ,z j + z 2 ) 
is or is not associative. 

8 Show that the operation on R expressed by the formula 

xy + k 

x*y = —— {x + y # 0 
x + y 

is associative for all and suggest some interpretations for special values of k. 

9 The operation o is defined on the number pairs A = (a 1 ,a 2 ) B = (b 1 ,b 2 ) 
so that A o B = (a l + b 2 , a 2 + b x ) and A = B if and only if = b x and 
a 2 — b 2 . Find whether the operation is associative. 

Find P such that A ° P = A, and Q such that Q ° A = A, and determine 
whether /, J can be found such that, for all A, A ° / = A and J ° A = A. 
Given that T* = ( — a 2 , —a 1 ) 9 discuss the following: 

Bo A = Co A 

=> B o A o A* = C o A o A * 

=> 5o(0,0) = Co (0,0) 

=> B = C. (m.e.i.) 


Identity and inverse 

Before we are ready to talk about groups, we must consider two other ideas, 
those of identity and inverse. If a set S is provided with an operation * and 
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there exists a fixed element e of S which combines with any element of S to 
leave it unchanged, i.e., 


'dx e S, x * e = x 
(right 
identity) 


and 


e * x = x 

(left 

identity), 


then e is described as an identity element for the set. For example, when 
the operation is addition in the set of integers, the identity is 0; while for 
multiplication, it is 1. These are the two numbers, 0 and 1, which play 
special parts in the axioms for a field (see 14.1). For addition in the set of 


2x2 matrices, the identity is 


0 0 
0 0 


the ‘null 5 matrix; for multi¬ 


plication it is 


the ‘unit 5 matrix. Note that 



is a 


left identity for the set of all 3 x n matrices, but a right identity for the set 
of all n x 3 matrices, and both a left and right identity only for square 
3 x 3 matrices. Similarly, we have already encountered the identity trans¬ 
formation in a set of transformations, the identity permutation and the identity 
function . In the case of subtraction on the set Z (or Q or R), we have a right 
identity x — 0 = x (for all x), but no left identity; similarly there is a right- 
but no left-identity in the case of division and also exponentiation. When we 
speak of ‘the identity element’ in future, we shall be concerned with 
associative closed operations having an identity which is both a left- and 
right-identity. 

The concept of inverse is one that has already arisen in several situations 
during the course of this book. The general idea is that of 4 undoing ’ an 
operation, i.e., of finding the element y which combines with x to give the 
identity, thus 


x * y = e 

(y is a right inverse) 


y * x = e 

(y is a left inverse). 


(the identity being 0) we 


For example, in the set Z with the operation 
have 

x + ( — *) = 0 and ( —*) + x = 0, 

so that —x, or the negative of x, is the inverse of x under addition. In the 
set Q with the operation x, (the identity being 1 ) we have x x l/x = 1 
(x =£ 0), and the multiplicative inverse is the reciprocal, and written x~ l . 
We have considered the inverse of a matrix under matrix multiplication in 
some detail, and we use the same notation, M ~to denote the inverse of 
a non-singular matrix M , satisfying MM ~ 1 = I = M~ l M (see 13.4). 

A similar notation is used in the case of inverses of geometrical transfor¬ 
mations, permutations and functions. For example, iff? is an anti-clockwise 
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rotation through 40° about O, then the inverse of this, a clockwise 40° 
rotation about 0, would be denoted R~ \ Again, if x is the permutation 

/I 2 3 4 5 6\ 

\5 1 6 4 2 3/ 


then the inverse permutation may be written 


/5 1 6 4 2 3\ 

or more simply 

VI 2 3 4 5 6) py 



3 4 5 
6 4 1 


and is denoted x~ l . 

You should check that the result of combining x with x' _1 either way 
round is to produce the identity permutation. 

Lastly, we have also considered (see i. i, Book i, Advanced Mathematics) 
the inverse of a function or mapping. Taking a particular example, 


if f:x 


X + 1 

2 — x 


, we wish to specify the inverse f 1 (*). 


pi, x + 1 

Calling y = -- 

2 — x 

=> 2y — xy = x 4- 1 
2 z/+l 


so the required inverse function is f 1 (x) 


2x + 1 
x + 1 


You should check that the ‘composition’ of the functions / and f~ l , in 
either order, does in fact give the identity function i:x \-+ x. 

Finally we notice that if x and y are two elements of a set which combine 
under an associative binary operation *, then the inverse ofx *z/isy _1 *x _1 ; 
this may be abbreviated by dropping the * and using the ‘juxtaposition or 
multiplicative notation of ordinary algebra: 

{xy )' 1 = y - 1 * -1 . 

The proof is straightforward: 

{xy) {y' l x~ l ) = x(y {y “'aT 1 )) = x^yy'^x' 1 ) = * {ex' 1 ) = xx ' 1 = e. 
(Note the repeated use of associativity.) 

Similarly, {y~ 1 x~ 1 ) (xy) = e, so that xy and y~ l x~ l are inverse elements 
(just as the inverse of the combined operation of putting on one’s socks and 
then one’s shoes is the operation of first removing the shoes, then removing 
the socks). Evidently the rule may be extended to 

(xyz) ~ 1 = z~ l y~ l x~\ etc. 




4 B 8 INTRODUCTION TO ALGEBRAIC STRUCTURES 

Exercise 14.2c 

1 Find the identity elements for sets under the operations n and under u 
(see 0.2 and 14 . 6 ), and show that it is impossible to find inverses for these 
operations. Find an operation which does admit inverses. 

2 (z) What must be included in the set (4, 8 } in order to achieve closure 
under multiplication mod 14. What is the identity element, and what is the 
inverse of 4 ? 

(ii) Which is the identity element in the system (2, 4, 8 , 10, 14, 16} under 
multiplication mod 18? 

(Hi) Show that (2, 4, 6 , 8 , 12, 14, 16, 18} is closed under multiplication 
mod 20 , but has no identity. 

3 Find: 

(z) the inverse of 27 and of 12 under addition mod 40 of (0, 1,2, . . .,39}; 
(ii) the inverse of 17 under multiplication mod 20 of (0, 1,2, . . ., 19}; 

(Hi) the inverse of 53 under multiplication mod 100 of (0, 1,2, . . .,99}. 


4 Find identity elements where they exist in operations described in 
14 . 2 b, 2, 3, 4, 5, 6, 7, 8, 9. 


5 Find the inverses of the matrices under matrix multiplication, and 
interpret geometrically where possible: 




/cos 80° —sin 80°\ 
ysin 80° cos 80/ 


6 Find the inverses of the following functions: 


(0 1 - 
(») «*; 


(») 

[vi) 


2 - x_ 

5 

X 

3 * - 7 

ox - 3' 


(zzz) 


2 + x 


2- x 
vii) log fl (log^). 


(iv) at; 


7 Find the inverses of the permutations 
1 2 3 4 5\ ... (\ 


(i 


[III 


2 4 5 1 3 

1 2 3 4 5 6 7 

4 7 5 1 3 6 2 


4 6 


3 4 5 6 \ 
5 12 3/ 


Verify in each case that x ° x 1 and x 1 0 x are both the identity permuta¬ 
tion. 
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8 Verify the rule (xy) ~ 1 = y~ l x~ 1 in the case when x and y are the per¬ 
mutations 

(\ 2 3 4 5\ _/l 2 3 4 5\ 

* V* 1 2 3 5j y \3 5 1 4 2/ 

9 Let S be the set of 2 x 2 matrices with elements in {0, 1, 2, 3, 4, 5}. If 

^4 £ S and ^4 has an inverse in S, show that det A = 1 or 5. (j.m.b.) 

10 Show that the set of matrices of the form 


l a 

b 

i \ 

V 

a 

* 

\b 

c 

a! 


are closed under matrix multiplication, and that the identity is in the set. 
Under what circumstances do we have inverses? 

11 Let © and ® be two binary operations (rules of composition) defined 
on the field Q of rational numbers, as follows: 
a ® b — a b — 1 a ® b = a + b — ub. 

Find the identity element, and the inverse of any number a , for each of these 
operations. 

Find all functions f:Q Q which simultaneously satisfy the equations 
f(a + b) =f(a) ®f(b) and f(ab) =f(a) ®f{b). (m.e.i.) 

14.3 Groups 

We are now ready to describe the vitally important structure known as a 
group, but before giving a general definition we shall first take two examples. 

Example 1 

Let us consider the set of integers {1, 3, 7, 9} and their products modulo 10 
(i.e., the remainders, or residues , when their ordinary products are divided 
by the modulus 10 ). 

As 3x7 = 21, 3x9 = 27, 9x7 = 63 etc., we may write 3x7 = 1 , 
3x9 = 7,9x7 = 3 (mod 10), and these products can be summarised: 

Table 1 

x 10 1 3 7 9 

1 13 7 9 

3 3 9 1 7 

7 7 19 3 


9 


9 7 3 1 
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The system has the following features. First, it is closed. Had we taken the 
set {3, 5, 7}, this would have failed for closure, for products such as 3 x 7 
would not have been contained inside the set. Second, the operation of 
multiplication is undoubtedly associative. Third, we have an identity 
element, 1. And fourth, every element has an inverse; e.g. since 3x7=1, 
the residues 3 and 7 are a pair of inverses. (This would not have been so 
had the number 5 been included, for the element 5 would have no inverse 
though closure would still have held good.) A system which passes these 
four tests — of closure , associativity , identity and inverses^ — is called a groups 
and as the present example has four elements, it is said to be a group of 
order 4. 


Example 2 

We now give an example of an infinite group, provided by the set of all 
non-singular 2 x 2 matrices with real elements under the operation of matrix 
multiplication. 

It is immediately clear that 

( i) The product of two such matrices with real elements is another such 
matrix, so closure is established. 

( ii) Multiplication of matrices is known to be associative. 

(Hi) The set includes the unit matrix 

(iv) Every non-singular 2x2 matrix has an inverse which is also in the 
set. 

So the fourfold requirements for a group are satisfied. 

This time we have not only an infinite group, but one in which the binary 
operation is non-commutative, since matrices do not commute under 
multiplication. By contrast, our first example {1, 3, 7, 9}, x (mod 10) 
was a finite commutative group. Such a group, in which every product is com¬ 
mutative is called Abelian , after the Norwegian pioneer of group theory, 
N. H. Abel ( 1802 - 29 ). Note that, within non-Abelian groups, it is possible 
for some products to commute: the identity will always commute with 
every element; and in the group of non singular 2 x 2 matrices under 
multiplication, we note that, 



Having provided these two very dissimilar examples, we are now ready 
to give our general definition: 



| We shall specify a group by stating first the set, then the binary operation, these being 
separated by a comma. 
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A group is a set G with a binary operation * subject to the following 
requirements: 

C * is closed in G, i.e., for all x, y e G, x * y e G 

A * is associative , i.e., for all x, y, z e G, (x * y) * z = x * (y * z ) 

N There is an identity or neutral element e satisfying 
x * e = e * a = x, for all x e G 

I Each element x has an inverse , x~ l , satisfying 
x * x~ 1 = x~ 1 * x = e 


If we now use the symbols V to denote Tor all’, 3 to denote There exists’ 
and : to denote ‘such that’, these four axioms of a group G, * may be con- 
viently summarised: 

C Vx, y e G, x * y e G 
A Vx,y,zeG, (x * y) * z = x * (y * z) 

N 3e e G: Vx e G, x*e = e*x = x 

I Vx e G, 3x"~ T x * x -1 = x _ 1 * x = £ 

If you refer to the axioms for a field (see 14 . 1 ), you will notice that 
A 1-4 and Afl—4 are equivalent to stating that we should have a group 
under addition and under multiplication, always remembering that the 
‘zero’ element (the identity for addition) must be removed in the case of 
the multiplicative group, since it has no multiplicative inverse. Thus the 
rationals, Q , are a group under addition, and also group under multiplica¬ 
tion, provided 0 is excluded (just as all singular matrices had to be ex¬ 
punged from the set before group structure of 2 x 2 matrices could be 
claimed). 


Exercise 14.3a 

1 Show that the following are finite groups and state the order of the 
group in each case 

(i) integers with a specified number of binary digits under the operation 
of addition mod 2 with no carrying (e.g., Oil * 110 = 101); 

x — 1 — 1 1 -j- x 

(ii) the functions x, -> -> - under function composition; 

X + 1 x 1 — X 

{Hi) all the 120 permutations of five letters; 

[iv] the set ( 0 , 1 , 2 , . . ., n — 1 } under addition mod n ; 

( 0 ) the set (1, 2, 4, 7, 8 , 11, 13, 14} under multiplication mod 15; 

(vi) all the symmetries (rotations and reflections) of the rectangle; the 
square; the equilateral triangle, the cuboid, etc. 
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2 Show that the following! are infinite groups 

(i) Z, + ; (it) R , + ; (tit) £*, x ; (iv) Q*, x ; (v) p X q matrices, 

under matrix addition (suggest numbers systems from which the elements 
might be drawn); 

(vi) n x n non-singular matrices under matrix multiplication; 

(vii) all isometries in the plane (i.e., transformations which preserve dis¬ 
tance) under successive composition; 

(viii) vectors in 2D space under vector addition; 

(ix) all integral powers of 3 under multiplication. 

3 Find subsets of the examples in no. i and no. 2 above which are them¬ 
selves groups (called subgroups) under the operation mentioned. 


4 Say why the following fail to be groups: 

( i ) odd integers under addition; 

(ii) integers under multiplication; 

(iii ) sets under the binary operation 4 intersection 5 6 7 ; 

(iv) the negative rationals, under addition; 

(y) {a + by] 2 - 1 - c^/3} (a, b, c € Z) 9 under multiplication; 
(vi) {a + by] 2 + Cyji] (a, b, c e Q), under multiplication; 


(vii) the permutations 


12 3 4 
2 14 3 


12 3 4 
3 4 12 


12 3 4 
4 3 2 1 


(viii) { 1 , — 1 , i) under multiplication; 


(ix) all irrationals together with zero, under addition. 



5 Show that the first table is a group table, but the second is not. In the 
third, fill in the blanks in the group table, given that it is Abelian, and that 
every element (say g) satisfies g 3 = 1 (identity) 

a b c d e f a b c d e Xpqrstuvw 

a e c b f a d a b d e c a 1 Xpqrstuvw 

b d e f a b c b c a d e b p p q X t . . . . . 

c f a d e c b c d e a b c q q X p . 

d b f e c d a d e c b a d r r ... 1 ... . 

e a b c d e f e a b c d e s s . 

f c d a b f e t t . 

u u .... X .. . 

V V . 

W U) . 

6 Show that the set {*, 1 , —x, —1} of linear polynomials in x, wh. 
multiplied modulo (x 2 + 1 ), are a group, and write out the group table 

oi i i /cos 0 — sin 

7 Show that the matrices I . _ ^ are a group under matrix 

\sm 0 cost// 

multiplication. What is the order of the group when 6 = 15°? 
f See Notation (p. xiii). 
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8 The set S consists of all transformations T of the real numbers given by 
T(x) = ax + b (a ^ 0, a, b real). 

Show that if T l9 T 2 e S, then T x T 2 e S. Prove that the set S forms a group 
under composition of functions. Find the elements T of £ of order 2 (i.e., 
TT = / and T # / where I is the identity transformation). Prove that 
these elements are all reflections. Show that there are no elements of order 
3 in 5. 

9 Let G be the set of all matrices of the form ^ with x a real 

number. 

(z) Show that G does not form a group under matrix addition. 

(ii) Show that G does form a group under matrix multiplication. 

(You may assume associativity of matrix addition and multiplication.) 

(J.M.B.) 

10 G is a finite group, and g is an element other than the identity. A set S 
consists of all positive integral powers of g (so that S = {g, g 2 , g 3 , - • - 
Prove that: (i) the identity belongs to S ; (ii) g -1 belongs to S. 

Give an example to show that neither of these need be true if the group is 
infinite. (s.m.p.) 

We now resume the illustration of groups in a variety of contexts. 


Infinite groups 

Other examples of infinite groups include the set of plane rotations and 
translations (see p. 478 ), the law of composition being as described there; in 
three dimensions, the set of transformations (called isometries ), such as 
rotations, translations and reflections, which preserve distance between all 
pairs of points; and also the set of vectors in three-dimensional space under 
vector addition. In each case, it should be confirmed that the four require¬ 
ments for a group are satisfied. 


Finite groups 

Turning to groups with a finite number of elements, we saw at the beginning 
of this section that there is a useful way of displaying all the products of 
elements in the group in the form of a table, and this is valuable in the case 
of small groups. For example, the matrices 



under multiplication combine according to the entries of 
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Table 2 


X 

I 

X 

Y 

z 

/ 

I 

X 

Y 

z 

X 

X 

I 

Z 

7 

Y 

Y 

z 

/ 

X 

Z 

Z 

Y 

X 

I 


and this describes the group completely. It will be recalled that precisely 
the same table arose (p. 480 ) when we combined the half-turns of the rec¬ 
tangular box, so that the table serves to describe the composition of the 
various rotational symmetries of the cuboid. 


By contrast, a quite different table arises when the four matrices 



combine under multiplication: 
Table 3 


X 

I 

R 

S 

H 

I 

1 

R 

s 

H 

R 

R 

H 

I 

S 

S 

S 

I 

H 

R 

H 

H 

S 

R 

I 


These matrices correspond to rotations about the origin through angles 
0, ^ 7 z, 71, §71, and so the table describes the rotational symmetries of a 
figure like 



Permutation groups 

Next we consider a group of permutations. 

Let x be the permutation 
2 by 3, . . ., 5 by 1 . 


(12 3 4 
\2 3 4 5 



i.e., x replaces 1 by 2 , 
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It is usual to denote such a ‘cycle’ by (1 2 3 4 5), each digit in the cycle 
enclosed in brackets being replaced by the succeeding one, the cycle being 
completed by the replacement of the final digit by the first. (So that this 
particular cycle might also be denoted as (4 5 1 2 3), and in various 
other ways). Similarly, the permutation 
/1 2 3 4 5 6 \ 

y V 4 6 5 2 1 v 

can be shown in cycle notation asj = (1 4 2 6 3 5); while the permuta- 
tion 

= /I 2 3 4 5 6 \ 

Z \5 3 6 4 1 2) 

contains two cycles, a = (1 5), and p = (2 3 6 ), the digit 4 being un¬ 
changed by the permutation. We also see that the cycle *=(1 2 3 4 5), 
when applied repeatedly to the five digits, changes the line successively as 
follows: 

1 2 3 4 5 —2 3 4 5 1 3 4 5 1 2 4 5 12 3 

—^ 5 1 2 3 4 1 2 3 4 5, 

so five applications of the permutation * lead to the identity permutation , and 
we write ********* = e. 

From now on we shall usually drop the symbol *, or whatever sign is 
used for the binary operation, and us t juxtaposition to denote the composition 
of two elements: thus * * y will be abbreviated to xy (first y, then *). By 
analogy with algebra, we also use * 5 to abbreviate ***** and, since we are 
now committed to a multiplicative notation, we shall henceforward use 1 for 
the identity element. Thus, in the above, * 5 = 1 describes the behaviour of 
the cycle * = (1 2 3 4 5). 

Evidently, since * 2 = 

we may write * 2 in cycle notation as (1 3 5 2 4), though it is easy enough 
to do this without first writing down the cumbersome replacement notation. 
The permutations 1, *, * 2 , * 3 , * 4 combine according to 

Table 4 


C 5 



(1 2 3 4 5) 
V3 4 5 1 2/ 
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and this particular type of group, generated by a single cycle, is called a 
cyclic group and denoted C 5 . In a similar way, the permutation 


A 2 3 4 5 
\4 6 5 2 1 



4 2 6 3 5) 


generates a cyclic group, this time containing 6 elements, or a group of 
order 6 , called C 6 . More generally, the cyclic group of order n is abbre¬ 
viated C n . 


Finite arithmetic 

Suppose in the case of the permutation x in the previous paragraph we had 
tabulated not 1 , *, x 2 , x 3 , and x 4 , but the 'indices’ (i.e., the number of times 
the permutation x had been applied, namely 0, 1,2, 3, and 4). Then we 
obtain 

Table 5 



0 

1 

2 

3 

4 

0 

0 

1 

2 

3 

4 

1 

1 

2 

3 

4 

0 

2 

2 

3 

4 

0 

1 

3 

3 

4 

0 

1 

2 

4 

4 

0 

1 

2 

3 


For example, when 4 is combined with 3, we get 2; because, in this system, 
x 4 x x 3 = x 7 = x 2 (as * 5 = 1). So far as the indices are concerned, we 
may say 4 + 3 = 7, but since any superfluous 5’s are of no interest, we 
’may work modulo 5, and declare that 4 + 3 = 2 (mod 5). Thus in Table 5 
above, the operation is addition mod 5 and we have an example of a finite 
arithmetic providing us with a group. 

More interesting groups arise in finite arithmetics when the operation 
is multiplication to some modulus: for example, the group of order 4 at the 
beginning of this section. Another simple example is provided by the set 
{0, 1, 2, 3, 4} under multiplication mod 5, when we obtain 

Table 6 


x 5 

0 

1 

2 

3 

4 

0 

0 

0 

0 

0 

0 

1 

0 

1 

2 

3 

4 

2 

0 

2 

4 

1 

3 

3 

0 

3 

1 

4 

2 

4 

0 

4 

3 

2 

1 
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Here there is an identity element 1, but the element 0 has no inverse, so 
the structure is not a group. If, however, 0 is deleted from the set, we do 
have group structure, described by 


Table 7 


x 5 

1 

2 

3 

4 

1 

1 

2 

3 

4 

2 

2 

4 

1 

3 

3 

3 

1 

4 

2 

4 

4 

3 

2 

1 


Sometimes with finite arithmetic under multiplication it is necessary to 
delete more than merely 0 from the set. For example, working mod 6 on 
the set 1, 2, 3, 4, 5, we arrive at 


Table 8 


X 6 

1 

2 

3 

4 

5 

1 

1 

2 

3 

4 

5 

2 

2 

4 

0 

2 

4 

3 

3 

0 

3 

0 

3 

4 

4 

2 

0 

4 

2 

5 

5 

4 

3 

2 

1 


and we see that, with identity 1, the only elements which are inverses are 
1 and 5. So we must omit 2, 3 and 4 to obtain the set {1, 5} with only two 
elements forming a group under multiplication mod 6. Nevertheless, the 
set {2, 4} also forms a group under x 6 , only here the ‘identity' is 4: 


Table 9 

x 6 4 2 

4 4 2 
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Again, working mod 15, the set {1, 2, 4, 8} is closed under multiplication: 
Table io 


X 15 

1 

2 

4 

8 

1 

1 

2 

4 

8 

2 

2 

4 

8 

1 

4 

4 

8 

1 

2 

8 

8 

1 

2 

4 


But it is more surprising to discover the set {3, 6, 9, 12} which has 6 as its 
identity under multiplication mod 15: 

Table n 


X 15 

6 

3 

9 

12 

6 

6 

3 

9 

12 

3 

3 

9 

12 

6 

9 

9 

12 

6 

3 

12 

12 

6 

3 

9 


Note that in Table io, the elements 1, 2, 4, 8 can be written as 1,2, 2 2 , 2 3 , 
with 2 4 = 1 (mod 15), so there is a certain resemblance to the situation 
arising from R 4 = / in Table 3 . Similarly in Table 11 , the elements may 
be written 3, 3 2 , 3 3 (= 12), 3 4 = 81 = 6 (mod 15). 


Symmetry groups 

The finite groups illustrated above have been drawn mostly from groups of 
permutations and from finite arithmetics—groups of residues to some 
modulus under addition and multiplication. But there have also been cases 
of groups of matrices and groups of symmetries, and we shall now have a 
little more to say about the latter. 

Tt was clear on p. 494 that a plane figure possessing rotational symmetry 
gave rise to a group of order 4. Similarly, a figure with rotational symmetry 
of order 9, i.e., which is unchanged by rotations though multiples of 40°, 
as in the case of the ‘plate’ illustrated, will have a symmetry group whose 
elements are rotations through 0°, 40°, 80°, . . ., 280°, 320°. 

Calling r an anti-clockwise rotation through 40°, the elements of the group 
(C 9 ) will be 1 , r, r 2 , r 3 , r 4 , r 5 , r 6 , r 7 , r 8 ; where, for example, r 7 r 5 = r 3 , 
indices being added modulo 9, since r 9 = 1. It is clear that such rotations 
may be represented by permutations of the vertices; for example, rotation r 
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replaces the vertices 0, 1, 2, . . ., 8 by 8 , 1, 2, . . ., 7 respectively, and so cor¬ 
responds to the cycle (0 8765432 1). Moreover, the same set of 
rotations may be represented by matrices; r, for example, being represented 
by 

/cos 40° — sin 40°\ 

ysin 40° cos 40°/ 


So we have a set of nine matrices forming the group of order 9 under matrix 
multiplication. Again, in the Argand diagram, rotation through 40° is the 
geometrical equivalent of multiplication by the complex number cis 40°. 
Calling this complex number co, we have the nine complex numbers 
{1, co, co 2 , co 3 , to 4 , co 5 , co 6 , co 7 , co 8 } forming the group of order 9 under mul¬ 
tiplication, e.g., co 7 x co 5 = co 3 (remembering co 9 — 1 ). These are the 
nine complex ninth roots of unity (cf. p. 135 ). 

Groups also appear in the case of sets of functions under function com¬ 
position (see 481 ). For example, if 


/(*) 


v — 1 

X + 1 


then /</(«)) =/■(«) - - ')/(^rr + ’) 


x — 1 — x — 1 1 

X — 1 T X T 1 X* 


/ 3 M 


X 

“1 

-+ 1 

X 


1 + X 
1 - X 


and finally, f*(x) = x — i(x), where i is the identity function. 
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So these four functions: i,ff 2 ,f 3 combine according to 
Table 12 



i 

f 

f 

f 

i 

i 

f 

f 

f 

f 

f 

f 

f 

i 

f 

f 

f 

i 

f 

f 

f 

i 

i 

f 


Cyclic groups 

Apart from the group of Table 2 , all the finite groups mentioned so far 
have been cyclic groups. This means that all the elements of the groups may 
be expressed in terms of a single element , or generator (as it is called). Thus in the 
case of the functions in Table 12 above, the elements were i, fJ 2 J 3 , where i 
might be expressed as/ 0 ,/is of course/ 1 , and f 3 might have been written 
f~ 1 , the group being governed by the over-riding relation / 4 = i. Indeed, 
whenever (as in Table 3 ) we have a situation of the type x 4 = 1, there we 
must have the cyclic group of order 4, usually called C 4 ; and the same 
group would arise in the multiplication of the set { 1 , i, — 1 , —i}, where 

i 2 = -1. 


Exercise 14.3b 

1 Write out the table for multiplication of the set {1,3, 5, 9, 11, 13} under 
multiplication mod 14, and show that it is a group. 

2 Show that, if x is the permutation 

A 2 3 4 5\ 

V4 1 5 2 3/ 

and e is the identity permutation, then e , x, x 2 , x 3 , x 4 and x 5 , form a group. 

3 What is the order of the group generated by the permutation 

A 2 3 4 5 6 7 8 \ 

9 \2 7 4 8 5 1 6 3 ) 

4 Show that { 1 , 4, 7, 13}, x 15 is a group, and find a group under x 15 
which contains the number 3. 

5 Find a group under x 2 o> containing the number 8 . 

6 Show that the functions mx, 1 — x, 1 — 1/x, together with their 
reciprocals form a group under function composition. Verify that it is not 
commutative. 
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7 What is the order of the symmetry group taking both rotations and 
reflections in all cases, of: 

(i) a playing card (say Jack of Hearts); 

(ii ) an isosceles triangle; 

(Hi) a rhombus; 

(iv) a pyramid on a regular pentagonal base; 

(y) a regular hexagon; 

(vi) a square. 

8 Draw examples of plane figures whose symmetry groups have order 
3, 4, etc. 


9 Show that the permutations 
1 2 3 4 5 6 > 

4 6 5 1 3 2; 

when combined in every possible way give rise to a group of order six. 


(\ 2 

3 

4 

5 

6\ 

and U 5 

4 

3 

2 

> 


10 Take the matrices 



and 



and consider all possible matrices obtained by combining them by multi¬ 
plication (e.g., AB , BA , ABA, etc.). Show that these form a group of order 8 . 


11 Show that the complex 12 th roots of 1 form the group C 12 under 
multiplication. Is it possible to obtain the group C 12 under addition of 
complex numbers? 


12 Obtain a set of permutations forming the group C 7 . Also obtain a set 
of permutations of seven objects forming the group C lQ . 

13 What are the periods of each element in the groups C 9 , C 12 ? 


14 By writing each of the set {1, 2, 3,. . 12} as a power of 2 (mod 13), 

e.g., 12 = 2 6 (mod 13), show that this set forms the group C 12 . 


15 A rectangular cuboid has the following symmetries: 

(i) half-turns about three mutually perpendicular axes (call these a , b , c ); 

(ii ) reflections in three planes of symmetry. Call these p , q , r, where p is the 
reflection in the plane containing the axes of b and c , etc. 

If the identity is called e , write out the table showing the composition of 
e , a , b , c and show that this is a group. What is its structure? Write out the 
table showing the composition of the seven symmetries listed, showing that 
a group is obtained (the ‘full group’ of the cuboid), only when an eighth 
symmetry is included, and specify the nature of this eighth element. 

16 In the set of ordered pairs of reals under the operation * defined by 
the formula 

(*i,yi) * i.x 2 ,y 2 ) = (, x i x 2 ,x l y 2 + yj, 

what is the identity, and what is the inverse of (x, y) ? What must be done 
to arrive at group structure ? 
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14.4 Isomorphisms 

Saying that Table 12 is the ‘same 5 group as Table 10 and Table n is 
rather like saying that f is the 4 same ’ fraction as f and • It is more correct 
to describe these fractions which are equal to f as belonging to an ‘equiva¬ 
lence class ’ of fractions, of which it is convenient to take f as the simplest 
representative. The tables for the groups discussed are the ‘same’ in the 
sense that they have identical patterns, as one can readily see by the 
‘stripes’ running down from N.E. to S.W. The stripes are not apparent 
in the case of Table 3 till a rearrangement is made by interchanging R and S : 


Table 13 



I 

R 

H 

s 

- 1 

I 

I 

R 

H 

S 

R 

R 

H 

S 

I 

H 

H 

S 

I 

R 

S 

S 

I 

R 

H 


The correspondence with Table 14 (below) for {1, z, —1, — *}, x is now 
clear when the elements are paired off: 

I <-> 1 ; R i; //«-►— 1; S <-► —i. 

Table 14 


X 

1 

i 

-1 

— i 

1 

1 

i 

-1 

— i 

i 

i 

-1 

— i 

1 

~\ 

-1 

— i 

1 

i 

— i 

— i 

1 

i 

-1 


This conforms with our knowledge that multiplication by i is represented 
geometrically by a quarter turn (R). However, there would be nothing 
to stop us setting up the correspondence 

I <“► 1 , R — Z, //<“► — 1 , S <-> Z, 

with R and S interchanged. The reader should do the same in the case of 
the group whose Table 7 is shown on p. 497. 

Groups which have identical structures are said to be isomorphic , and the 
idea of isomorphism is one of the most important in mathematics. Indeed, 
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the fact that the same pattern may arise in a wide variety of situations is the 
main justification for a close study of structure as it means that techniques 
devised for one situation may well be applicable in a seemingly different 
context, because the two situations may, for all their apparent diversity, 
really be examples of the same structure. The relation between two groups 
of being isomorphic is usually denoted 4 = \ It will emerge later that this is 
a so-called equivalence relation between groups, and that isomorphic groups 
form an equivalence class , which may be called the abstract group of that 
structure. 


Groups of order 4 

We have already seen, however, that there are two quite distinct groups 
of order 4: 



1 

X 

y 

z 



1 

X 

y 

z 

1 

1 

X 

y 

z 


1 

1 

X 

y 

z 

X 

X 

y 

z 

1 

and 

X 

X 

1 

z 

y 

y 

y 

z 

1 

X 


y 

y 

z 

1 

X 

z 

z 

1 

X 

y 


z 

z 

y 

X 

1 


The first is the cyclic group C 4 , and we have had several examples ranging 
from the rotations of the figure on p. 494 to the set {1, 2, 3, 4} under multi¬ 
plication mod 5. Other examples of this group are to be found in Tables 1 , 3 , 
7 , 10 , 11 , 12 , 13 and 14 . 

The second, although so far provided with only two illustrations, (p. 480 
and Table 2 ), is no less important. This is known as the 4 Klein 4-Group \ and 
contains three elements of period 2. Other representations of this group 
which the reader may check are {1, 3, 5, 7} x mod 8 , and the combination 
of the four functions x, —x, \/x, —l/x. For representations of the Klein 
4-Group in a large number of contexts, see Budden, F. J., The Fascination of 
Groups , C.U.P., 1972 , pp. 131 - 8 . 

The above two are, in fact, the only possible distinct groups of order 4. 
As for groups of higher orders, it is easy to show that there is only one pos¬ 
sible group of order n whenever n is prime; whilst for composite values of n , 


n = 6 : 

2 groups 

(including one non-Abelian) 

n = 8 : 

5 groups 

(two non-Abelian) 

n = 9: 

2 groups 


n = 10 : 

2 groups 

(one non-Abelian) 

n = 12 : 

5 groups 

(three non-Abelian) 
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Groups of order 6 

Consider the three sets of permutations: 



(0 

1 2 3 4 5 6 

2 3 4 5 6 1 

3 4 5 6 1 2 

4 5 6 1 2 3 

5 6 1 2 3 4 

6 1 2 3 4 5 


1 1 2 3 4 5 

r 3 12 4 5 

r 2 2 3 1 4 5 

d 1 2 3 5 4 

y 3 12 5 4 

z 2 3 15 4 



3 

2 

1 

2 

1 

3 


It may easily be verified that each of these sets is closed, and that in each 
case we get a group of order 6 : 


Table 15 



(iii) 


1 

1 

P 

<N 

a 

b 

c 

1 

1 

P 

P 2 

a 

b 

c 

p 

P 

P 2 

1 

b 

c 

a 

P 2 

P 2 

1 

P 

c 

a 

,b 

a 

a 

c 

b 

1 

P 2 

P 

b 

b 

a 

c 

P 

1 

CM 

c 

c 

b 

a 

P 2 

P 

1 


(It should be noted that the permutation which is performed first is shown 
across the top of the table, the second permutation being shown down the 
side, so that in Table (iii), for example, ac = p signifies that permutation c 
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is performed first, so that the a is read off on the left and the c at the top). We 
shall now show that the first two are isomorphic, but that their structure is 
different from that of the third group. 

First note that x may be denoted in cycle notation as (1 2 3 4 5 6 ), 
and clearly * 6 = 1, whereas x n ^ 1 for n = 1, 2, 3, 4, 5. We say that the 
permutation x has period 6 or order 6 , meaning that 6 repetitions of the per¬ 
mutation are needed in order to get back to the identity. The 6 permuta¬ 
tions of the group are thus {1, x, x 2 , * 3 , x 4 , x 5 }, and the group is C 6 . 

Looking at the second set of permutations, we see that r and r 2 are each 
of period 3, that d is of period 2, but when we come to y (= dr = rd), we 
have a permutation of period 6. This encourages us to begin to set up a 
correspondence between the first two groups by matching x with y. Indeed, 
it turns out that y 2 = r 2 , y 3 = d , y 4 = r,y 5 = 2 , so that we have the cyclic 
group { 1 , y, y 2 , y 3 , y 4 , y 5 } governed by y 6 = 1 . The tables for the first two 
groups would thus correspond exactly; for wherever * occurs in the first, 
we should have y in the second. This same group would arise in other 
situations, in fact whenever an equation of the type x 6 = 1 appears, e.g., 
if co = cis 60°, co 6 = 1; or if R were a rotation through 60°, we should 

have R 6 = I (the identity); if M = ^ then M 6 = /; again, 

since 3 6 = 1 (mod 14), we have the group C 6 appearing as a multiplicative 
group in a finite arithmetic. 

The group of the third set of permutations is different from the first two 
in the sense that, while we can easily enough set up a 1,1 correspondence, 
we cannot do so in such a way as to preserve products. That this must be so is 
immediately apparent from the fact that the third group contains no 
permutation of period 6 , the periods of permutations being respectively 
1, 3, 3, 2, 2, 2. Another easy way of seeing that the third group is different 
is because, unlike the first two, it is not Abelian; for b — (1 3), c = (1 2), 
then remembering that be means ‘ first c, then b ’, we have be = (1 2 3) = p 2 , 
whereas cb — (1 3 2) = />. This non-Abelian group of order 6 will be dis¬ 
cussed in more detail on p. 509 when we consider the symmetries of the 
equilateral triangle. 

Isomorphism of infinite groups 

Now it is all very well saying that two groups are isomorphic if their tables 
may be rearranged to correspond, but this is only meaningful in the case of 
finite groups. How may one describe isomorphism in the case of infinite 
groups? To answer this, we must give a more precise definition for two 
groups G, G' with different operations 0 and *: the groups G, 0 and G', * 
are said to be isomorphic provided there is a one-to-one correspondence 
between the elements 
x, y, . . . G G and y', . . . G G' 

such that for all x, y G G and y' e G', it is also true that x o y corresponds to 

* y- 
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Now take the group R + , x of positive reals under multiplication and 
the group R , + of all reals under addition. 

These can easily be shown to be isomorphic by the simple process of 
mapping the first set onto the second by the logarithm function x \-+ In x. 
Then if z = xy in the first group, we have In z = In x + In y, so that 
z' = x' + y’ in the second group and the structural properties of the first 
group are reproduced in the second (where the operation is, of course, 
addition). 

Again, consider the groups Z, + ; 10Z, +; and also the geometrical 
group generated by a translation t : 


t 2 



We may show that these three groups are in fact isomorphic by setting 
up a correspondence where x in Z, + corresponds with 1 Ox in 1OZ, + and 
to f in the group of translations. Then ‘products’ (as we shall continue to 
call them, though the operation is addition) are preserved. 

For example, — 5 + 2 = — 3 in the group Z, -b 

becomes — 50 T 20 = — 30 in the second group, 

and t~ 5 x t 2 = t~ 3 in the case of the third group. 

Or, more generally, x + y = z in Z, + 

corresponds to lOx + lOy = lOz in 10Z, +, 

and to ft y — f in the third group. 

Consider next the operation on the integers given by the formula 

m * n = m + (— \) m n. 

This is clearly non-commutative, but is in fact associative. Closure is self- 
evident, while 0 is a left and right identity. Finally, the inverse of an even 
integer is its negative , e.g., 

18 * (-18) = 18 + (— 1) 18 (— 18) = 0, 

(-18) * 18 = -18 + (— 1) — 18 18 = 0, 
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but the inverse of an odd integer is itself, e.g., 

19 * 19 = 19 + ( — 1) 19 19 = 0 
(-19) * (-19) = -19 + ( —1) -19 (— 19) = 0. 

Thus Z, * is a group. 

For a second group we take the series of images set up by reflection in 
two parallel mirrors. 




In the figure, a represents the operation of reflecting in the first mirror, 
b represents a reflection in the second mirror, and following the usual 
convention, baba (for example) represents the result of two pairs of alternate 
reflections in the two mirrors, starting with the first. In the figure, each 
image of the bird motif is labelled according to the reflections leading to it. 
These two reflections generate an infinite group, the elements of which are 
every possible ‘word ’ formed from the two letters a and b , e.g., abababababa , 
bababa , etc. (since a 2 = b 2 = 1 , there will be no powers of a and b higher 
than 1 ). 

We now show that these two infinite non-Abelian groups are isomor¬ 
phic. The method of setting up the correspondence between Z, * and 
the group generated by reflections a and b is that shown in the figure 
whereby each image is labelled not only with the ‘word 5 describing the 
reflections, but is also allotted an integer in the bottom row. The way the 
correspondence works out is that the number in the bottom row, regardless 
of sign, says how many letters there are in the ‘word’, e.g., —3 for aba ; 
+ 6 for bababa ; while the sign is + if the initial letter of the word is b , — if 
it is a . The way in which the products are ‘imitated’ between the two 
groups is illustrated in two cases: 

Z, * Reflections 

5 * 2 = 3 ( babab) ( ba) = babab 2 a = bab 

— 5 * 3 = — 8 ( ababa ) (bab) = abababab 

6 * — 3 = 3 (bababa) (aba) = bababa 2 ba — bab (usings 2 = b 2 = 1). 

Thus the two groups are isomorphic, and in giving this example we have 
incidentally illustrated how groups may arise in the symmetries of an 
infinite geometrical pattern, here a one-dimensional ‘ frieze ’ pattern formed 
by the bird motifs. 
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Exercise 14.4a 


1 Show that {l,z, — 1, — i}, x is a group isomorphic to the group 
{1,3,7, 9}, x mod 20, and to the group of rotations of the figure on p. 494 , 
and also to the permutations {i, x, x 2 , x 3 } where 


A 2 3 4 5 6 \ 
V2 1 4 5 6 3/ 


2 Invent other examples of the Klein four group other than those in the 
text: 

(z) as a finite arithmetic under multiplication; 

(ii) as a set of permutations (of four objects); 

(Hi) as a set of four 2x2 matrices; 

(iv) as a symmetry group of a plane figure other than the rectangle. 


3 Why is it not possible to set up an isomorphism between 

(i) the symmetries of the equilateral triangle and the rotations of a 
regular hexagon; 

(ii) the symmetries of the ellipse and {1, 3, 7, 9}, x 10 

(iii) the complex numbers under addition and the vectors in three- 
dimensional space under addition; 

(iv) the positive reals under multiplication, and the rationals under addi¬ 
tion. 


4 If a group is isomorphic to a proper subgroup of itself, what must 
necessarily be true? 

5 Show that {2", ra e Z}, x is isomorphic to the set of translations in a 
line generated by a single given translation. 

6 Let S be the set {*: 0 < x < 1 , x e R}. Show that S with the operation 
addition mod 1 (e.g., 0.7 + 0.84 = 0.54), is a group, and that this is iso¬ 
morphic to the group of rotations of the circle. 

7 Establish the isomorphism between the groups shown in these two 
tables: 



a 

b 

c 

d 

f 

g 

a 

a 

b 

c 

d 

f 

g 

b 

b 

c 

a 

f 

g 

d 

c 

c 

a 

b 

g 

d 

f 

d 

d 

f 

g 

a 

b 

c 

f 

f 

g 

d 

b 

c 

a 

g 

g 

d 

f 

c 

a 

b 



p 

q 

r 

s 

t 

u 

P 

p 

q 

r 

s 

t 

u 

q 

? 

P 

u 

t 

s 

r 

r 

r 

u 

s 

q 

P 

t 

s 

s 

t 

q 

u 

r 

P 

t 

t 

s 

P 

r 

u 

q 

u 

u 

r 

t 

P 

q 

s 


8 In the group C 9 {0, 1, 2, . . 8 } + 9 , which elements are generators? 

Which are generators in similar representations of C Q , C 10 , C 12 ? If every 
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element of the cyclic group C n is a generator (except the identity), what can 
you say? 

9 Consider the group {\,p,p 2 ,p 3 } with p 4 = 1. Show that the permutation 
of the elements which interchanges p and p 3 preserves the structure, but no 
other permutation of the four elements has this property. (Such a permuta¬ 
tion is known as an ‘automorphism’, in this case of C 4 ). 

10 Find the automorphisms of the group {1, a, b, c] with a 2 = b 2 = 1, 
ab = ba = c. 

11 Show that the correspondence x «-* — x is an automorphism of Z, +. 

12 Show that {1, a, a 2 , a 3 , a 4 , a 5 , a 6 }, x where a 7 = 1 has six automor¬ 
phisms, one of which is a—► a 2 , and find the others. Find the automorphisms 
of the group C 8 formed by the eighth complex roots of 1. 

The group of equilateral triangle , D 3 

So far the only non-Abelian finite group has been the one of order 6 con¬ 
sisting of every possible permutation of 3 digits. This is the smallest case of a 
non-Abelian group, and we now look at it in more detail. In setting up the 
table showing the products, we must (since many of the products are non- 
commutative) specify that when we read the product xy from row x and 
column y , this is to be the value of xy ; so y, the first operation, is shown across 
the top of the table, and the second operation is down the left-hand side. 
The table showing the composition of the permutations will be found to be 

Table 16 

1 st operation 




1 

p 

q \ a 

l 

b 

c 



1 

1 

p 

i 

q 1 a 

1 

b 

c 


c 

P 

p 

q 

1 1 b 

c 

a 

(13 2) 

_o 




1 




cd 

q 

q 

1 

P c 

a 

b 

(12 3) 

<L> 

Oh 

-- 

— 

— 

_l__ 

1 

— 

— 


O 

'V 

a 

a 

c 

b 1 1 

q 

P 

(2 3) 

G 




t 




CM 

b 

b 

a 

r | p 

1 

q 

(13) 


c 

c 

b 

1 

a | q 

P 

1 

(2 1) 


To complete the table, it is not necessary to find every single product from 
the permutations as we did on p. 505. For, once having obtained be = q , 
we may immediately deduce (for example): 

bq = bbc = b 2 c — r, since b 2 = 1 

qc = bcc — be 2 = b , since c 2 = 1 

cp — bqp — b\ = b, since qp = 1 , and so on. 
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We next show that the same group arises as a group of symmetries. 
Consider an equilateral triangle 1 2 3 which is subjected to the following 
rotations: 

P = (132). 



It will be seen that/? is an anti-clockwise or a clockwise rotation according 
to whether the triangle has or has not been turned over (is showing 4 heads ’ 
or ‘tails’); q is simply the inverse of this, so q = p~ 1 = p 2 ; whilst a is a 
rotation about the median through 1, so interchanges the vertices 2 and 3. 
Similarly, b = (1 3) and c = (1 2). These five rotations together with the 
identity constitute the symmetry group of the equilateral triangle. It is 
clear, not only that every rotation has been matched with a corresponding 
permutation of the vertices, but also that the rotations will combine in a 
way which exactly mimics the composition of the permutations. So we have 
an isomorphism. For example, corresponding to the equation pc — a 
expressing the composition of the permutations (1 3 2) (1 2) — (2 3), 
we observe that if the triangle is subjected first to a rotation about the 
median through 3 (thereby turning it over) and then to a (now clockwise) 
120° turn/?, we obtain the final position which would arise from a single 
rotation about the median through 1, i.e., a = (2 3): 



Note that 

(i) a^b^c were described as rotations, taking the triangle into three dimen¬ 
sions, but could equally well have been regarded as reflections ; and 

(ii) the triangle could just as well have been a uniform triangular prism , 
the rotations a , b , c being about symmetry axes situated midway between 
the triangular faces. 

In the previous paragraph we have been considering the group of sym¬ 
metries of the equilateral triangle which includes not only the plane 
rotational symmetries through 0, 120°, 240° (1, /?, q) which would be 


possessed by a figure like 



but also the bilateral symmetries 


about its three medians. Similarly, if we look at a square instead of an 
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equilateral triangle, it would possess not only the rotational symmetries, 
as in Table 3, but also the bilateral symmetries about four axes, 



and these will together give a group of order 8, the symmetry group of the 
square. The full group of rotations and reflections of a regular w-sided 
polygon is called the dihedral group of order 2 n, and is denoted D„. A useful 
exercise for the reader would be to construct the group table for D 4 , using 
the notation of this figure where (for a change) we use a, b, c, d to be fixed 
symmetry axes (unlike the axes which moved with the triangle in p. 510) 
while r is an anti-clockwise quarter-turn. The table should be set out with 
the elements in the order 1, r, r 2 , r 3 , a , b , c, d. 


Infinite symmetry groups 

Usually the symmetry group of a geometrical figure is finite, but we have 
already met (p. 507) one case of an infinite group, when the geometrical 
figure itself extended to infinity. The groups arising from repeating patterns 
are of great interest, but there is not spadfe to deal with them here, and the 
reader is referred to Budden, F. J., The Fascination of Groups , C.U.P., 1972, 
Chapter 26. Another case when we obtain an infinite symmetry group is 
that of the circle, which is not only symmetrical by rotation through any 
angle, but also possesses bilateral symmetry about any diameter. 


Symmetry groups of solids: the tetrahedral group 

We have already mentioned the symmetry groups of the cuboid (Table 2), 
and more recently the six rotational symmetries of the equilateral tri¬ 
angular prism. When dealing with the symmetries of three-dimensional 
geometrical figures, one has to specify whether one is concerned with 
rotations only (as above), or whether one is going to include reflective, or 
‘opposite 5 symmetries, in which case the cuboid would have a group of 
order 8, and the triangular prism one of order 12 . A good illustration of this 
is provided by the regular tetrahedron, which has eight 120° turn sym¬ 
metries about its four altitudes, but also three half-turns about its joins of 
mid-points of opposite edges which, with the identity, make a group of 
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order 12. If the vertices were labelled 1, 2, 3, 4, the 120° turn would corres¬ 
pond to such permutations as (1 2 3), while the half-turns correspond to 
(1 2) (3 4), etc. When reflections are included we should get all the other 
12 permutations of the four vertices, such as (1 2), (1 2 3 4) etc., which 
cannot be obtained as a result of direct geometrical movements, but only 
with the aid of mirrors! The group of all 24 (= 4!) permutations of 4 digits 
is called the symmetric group , S 4 . More generally, the group of permutations 
of n symbols is the symmetric group S n , and is of order n\. On p. 509 , we 
saw that the group of the equilateral triangle was in fact isomorphic to S 3 , 
so we may write D 3 = S 3 . As an exercise, the reader should construct the 
group of order 12 of the rotations of the regular tetrahedron. This should 
be done using the permutations p = (1 2 3) and a — (1 2) (3 4) 
described above, and it will be found that the whole group is generated by 
using these two permutations only, i.e., by forming all possible ‘words’ 
from a and p. The group is known as the tetrahedral group, or the alternating 
group of order 12 . 


Exercise 14.4b 

1 What are the symmetry groups of the octagons: 



2 Draw hexagons whose symmetry groups are C 3 , C 2 , D 2 , D 3 . 

3 Show that two mirrors at an angle of 90° will produce a pattern with the 
symmetry group D 2 ; and that if the angle were 60°, the group would be 
D 3 . Generalise. 


4 Show that the matrices 



and 



generate a group of order 8 , and that this group is isomorphic with the 
group of symmetries of the square. 


5 If/(x) = 2 — x and g(x) = 2/x, find gf (x) and fg(x), and the other 
functions generated by f and g. What group do they form? 


6 Note that the group of the equilateral triangle has the following rela¬ 
tions connecting the elements: 

ap = p 2 a ; ap 2 = pa\ pap = a; 


( 1 ab ) 3 4 = 1 ; aba = bab 
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and interpret these in geometric terms. Find other relations connecting: 
(i) b and q\ (ii) b and c. 

Show that the relations a 2 = p 3 = 1, ap = p 2 a are sufficient to deter¬ 
mine the group completely, and find another set of ‘ defining relations 5 
based on the ‘generators’ a and b. 

7 For a regular hexagon, let r denote a 60° rotation about its centre, and 
a, b, c, g be reflections in the six lines of symmetry. Show that, a = rar , 
and obtain b , c , g each in terms of the generators a and r, it being sup¬ 
posed that the six axes are labelled consecutively in the direction of the 
rotation r. Draw up the group table of the twelve symmetries of the regular 
hexagon (Z) 6 ). 

8 Discuss various tetrahedra with rotation groups smaller than the group 
of the regular tetrahedron (e.g., a tetrahedron with one face equilateral 
and the other three congruent isosceles triangles has the group C 3 ). 

9 Consider the rotations of the cube and classify them according to their 
period (e.g., a rotation through 90° has period 4). 

10 Show that the regular octahedron has symmetry group isomorphic 
to that of the cube. How many of the symmetries are rotations, and how 
many are opposite? 

11 Investigate (infinite) symmetry groups generated by: 

(i ) a single glide reflection; 

(ii) reflections in two parallel mirrors; 

(Hi) reflections in two mirrors at various angles; 

(iv) a half-turn and a translation; 

(v) a half-turn and a mirror reflection. 

Invent other examples. 



Epilogue 


How many numbers are there? 

We have come a long way since first learning to count and now retrace our 
steps to this apparently childish question. By now, however, we are aware 
of a variety of sets of numbers. In arithmetic modulo 2, for instance, there 
are just two elements, 0 and 1, whilst in the system of complex numbers 
there is a far richer variety including such diverse numbers as 

0, 1, -3, f i, yj2, V 3 - i, etc. 

In this epilogue we shall confine ourselves to the real numbers R and 
shall start by looking again at their subset of positive integers, Z + . 

The positive integers, Z + 

-•»••••- 

0 1 2 3 4 5 

A child very soon realises that there is no end to the sequence of natural 
numbers, or positive integers. Even so, they are capable of being counted , 
or set in order, because this is precisely what is done when we say ‘ one, two, 
three, four, five, . . . : we name them in sequence, so that — given life 
and breath — any given number would ultimately be reached. We there¬ 
fore say that the positive integers Z + are countable. 


The integers, Z 


-3 -2 -1 0 1 2 3 4 5 

When, however, we consider all the integers (positive, negative and zero) 






EPILOGUE 515 

an immediate difference is apparent, that there is no obvious starting-point 
for such an operation of counting. Even so, they can clearly be set down in 
the order 

0, +1, -1, + 2, -2, +3, -3, .... 

Here again, every integer — positive, zero or negative, and however 
large — has its place in this sequence, which therefore provides us with a 
means of threading our way through them. So, just as for natural numbers, 
the set Z of all integers is countable. 


The rational numbers, Q 

There are, of course, many gaps between the integers and into these gaps 
we can place other quotients of integers like f, — so obtaining the set Q 
of rational numbers: 

“6 2 T 

m » » ‘i m '• 1 • 9 -»- 

— 4 —3 —2 —1 0 1 2 3 

It is clear that there is no end to the number of such rationals that can be 
placed between two integers. The interval between 0 and 1, for instance, 
could be divided into tenths, hundredths, thousandths or millionths, and 
each of the points of subdivision would correspond to a rational number. 
So the number of rationals between 0 and 1 is clearly infinite. 

This is, of course, just as true of the number of rationals that can be 
placed between any two given rationals, however close they are. For if 
plq and r/s are the given rational numbers (p, q , r, s being integers), their 
mean is 

Up , r \ = ps_+jr a 

2 v? s ) %qs 

Now ps + qr and 2 qs are integers, so this mean is itself a rational number 
and we see that between any two rational numbers there is always another, 
and, therefore, by continued bisection, an infinity. 

There is, therefore, certainly no shortage of rational numbers and the 
former question again arises. Is it still possible to count our way through 
this multitude of rationals, or are they too numerous? Can we construct a 
sequence in which every rational number has its place? 

This question was first raised in 1873 by the German mathematician 
George Cantor and did not prove difficult for him to answer. Rather, as we 
shall see, did his genius lie in asking the question. 
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Let us start by considering the positive integers, together with the 

number 0. First we construct the pattern 

0 

1 

I 

1 2 

2 1 

1 y 3 

3 fi \ 

12 3 4 

4 3 2 I 

i y y y 5 

5 A ft A 1 


and delete any number which is not in its lowest terms. Now if this pattern 
is continued indefinitely, every positive rational number will have its 
place. We notice, for instance, that the fifth row consists of all positive 
rationals whose numerator and denominator add up to five and the sixth 
row consists of those whose numerator arid denominator (when expressed 
in their lowest terms) add up to six. Similarly the rational number -jj 
would occur in the 20th row; and, more generally, the number p/q (if 
expressed in its lowest terms) in the (p +■ ^)th row. 

So, although the positive (and zero) rationals are extremely numerous, 
we have invented a way of counting them: 



I 
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Now we can just as easily count all the rationals (positive, zero and negative) 
simply by placing each negative rational immediately after the correspond¬ 
ing positive rational: 

0; +1, —1; +“2 9 “25 +2, —2; H-i, *••• 

Hence the set Q of all rationals is countable. 


The continuum of real numbers, R 


- 3 - 2-10 1 2 3 

There is such a multitude of rational numbers, with an infinity of them 
inside any interval, that the question naturally arises whether there is 
room for any other kind of number. Now we have seen that Euclid had 
effectively answered this question about 300 bc when showed that 2 is 
not a rational number and we naturally ask whether such numbers are 
rare or common. Are most numbers rational or irrational? Certainly we 
have seen that the rational numbers are countable. Can we similarly 
count all the numbers on the straight line, rational and irrational? 

This question too was first asked by Cantor and at the beginning of Decem¬ 
ber in 1873 —just a hundred years ago as this epilogue is being written 
he answered it by proving one of the most famous results of mathematics, 
that of the non-enumerability of the continuum: the real numbers R , rational and 
irrational , are too numerous to count. 

Let us consider the numbers between 0 and 1 and suppose that they can 
be arranged in a sequence 

a l 9 a 2 5 a 3 ? a 4> • • • 5 

which includes every such number. 

Now each of these numbers can be expressed as a decimal. It is true that 
there is a slight difficulty about decimals which terminate as they can be 
represented either with a recurring 0 or a recurring 9: , for instance, can 

be written 

either as 0.150 000 . . . 
or as 0.149 999.... 

But in such cases we simply stipulate that a recurring 9 is never to be used. 
So each number can be represented uniquely in decimal form, and we can 
write 

—■ 0 • CC ^ 1 0 C 1 2 ^ i 3 OC 14 . • • 

#2 — 0 . OC 2 1^2 2 ^ 23^24 * * * 
a 3 — 0 • 0 C 31 OC 32 & 33 OC 34 • . • 
a 4 = - 0.a 41 a42a43a 4 4... 

as a sequence containing every number between 0 and 1 . 
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Having done this, we now attempt to construct another number 

6 = 0 .jW 3 /V.. 

which is different from every a. Deciding to be as perverse as we possibly 
can, we see that this could be achieved if b were 

different from a x in its first decimal place, 
different from a 2 in its second decimal place, 
different from a 3 in its third decimal place, and so on. 

We therefore choose /? l5 j 8 2 , j 8 3 . . . by the following rule: 

if a nn # 5, choose = 5, 
and if a nn — 5, choose /?„ = 4. 

Then ^ a nn , for all values of n. 

Hence b = 0,p i ^ 2 p 3 . . . 

is different from every single member of the sequence , a 2 , a 3 . . . . By 
this means, therefore, we have succeeded in constructing a number between 
0 and 1 which is not a member of this sequence. Hence our initial assumption 
is contradicted, and we see that it is not possible to count even the real 
numbers between 0 and 1. It is, therefore, certainly impossible to count all 
the real numbers R , so that the continuum is not enumerable . 

This, of course, provides us with another proof, over two thousand years 
later than that of Euclid, that there are other numbers besides the rationals. 
For the rational numbers were countable, but the continuum is certainly 
not countable, so must be far more numerous; and whereas Euclid’s proof 
produces a single irrational number, Cantor produced an uncountable 
infinity all at once. 


Transcendental numbers 

It is now interesting to speculate about the numbers which constitute this 
uncountable multitude. Which are they? A first guess would be that they 
are irrational numbers like yj 2, %/5 and But this is mistaken. For the 
positive rational numbers Q + , as they are countable, can be written 


r i, r 2 , r 3 . . . 
so their square roots are y/r t , ^Jr 2 , s Jr 3 . . . 
their cube roots are v^i> \J r 2 > \J r< $ 
their fourth roots are ^jr l , ^/r 2 , ^ jr 3 . . . 


etc. 
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Now we can count through this set of numbers by this route: 



% 


and so the set of all positive rational numbers and all their roots is countable. 

It can similarly be shown that the set of all polynomial equations with 
rational coefficients have roots (known as the set of algebraic numbers) 
which are also countable. So it is clear that there must exist a vast un¬ 
countable multitude of numbers which are not the roots of any such 
equation. These numbers, which are not algebraic, are called transcendental. 

Which numbers are transcendental? Certainly it is clear that no rational 
number or root of a rational number is transcendental, and we might well 
wonder what comprises this vast galaxy. Now by coincidence it was also 
proved in 1873 that e cannot be the root of any algebraic equation with 
rational coefficients, so must be just such a transcendental number; and in 
1882 it was shown that n is another. Even today, however, we know very 
few kinds of naturally occurring transcendental numbers: In 2 , e n , 2 ^ 2 , sin 1 
have been proved transcendental, but they are rarities. 

What is known is simply that when all rationals, together with all roots 
of all rationals, and together with all the roots of all algebraic equations 
with rational coefficients, have been removed from the continuum, it has 
hardly been diminished, and that the infinitely vaster multitude of trans¬ 
cendental numbers has remained secretly in hiding. And in this, it might 
be said, these numbers are typical of the whole of mathematics, where 
much is known but much more is not. 



Answers to exercises 


Exercise 9.1a 

1 (: i ) 0.69 (ii) 1.10 (iii) 1.79 (iv) 2.20 

(v) 2.48 (vi) -0.69 (vii) -1.10 


Exercise 9*ibf 


1 


2 

3 

4 


5 

6 


(0 1.386 30 (ii) 1.791 76 (iii) 2.302 59 (iv) 2.484 91 
(») -0.693 15 (vi) -1.791 76 (i/ii) -2.48491 
(viii) 13.815 54 (ix) 0.346 57(5) (x) 1.535 06 

(i) 3 (ii) 0 (iii) —1 (iv) 0.5 

1 


(0 * 2 

1 


(ii) £ 100 

3 


(m) ^ (w) 2 


(0 - (**) - («i) - (iv) 


(viii) 
(xii ) 


x(x + 1) 
1 — In x 


(ix) 


- 1 


M - 

« 


1 . .. 1 

* 2x 

3x + 2 


(w) — (m) 


2 * 


* 2 + 1 


2x(x + 1) 


(xi) 1 + In x 


(z) cot x (ii) — tan* (iii) tan r (iz>) sec * ( 1 ;) cot r 

2 In ^ 1 

(vi) 3 cot 3# (vii) 2 cot * (viii) 2 cot a: (ix) - (*) - 

x x In x 

(i) 3 In x (ii) In (x m + 2 ) (iii) \ In (2x + 3 ) (iv) x 4 - In x 
(v) -In (1 — x) (vi) In (x 2 + 1 ) 

(vii) — f In (1 — x 3 ) (viii) j In (x 2 — 2x + 3) (ix) ^ In (x 3 + 3x) 


t In this and subsequent exercises, the arbitrary constant has been omitted from all indefinite 
integrals. 
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(*) In sin x (xi) | In sec 3* (xii) 2 In sin - 

(xiii) In (sin* — cos*) (xiv) In (1 + tan*) 

7 (i) 0.916 3 (ii) 0.336 5 (iii) 0.125 6 (iv) -1.098 6 

(v) Impossible (vi) 0.49 (vii) — 0.804 7 ( viii ) Impossible 
(?'*) 0.693 1 (*) 0.346 5 

8 (i) 0.693 65 (ii) 0.693 10 

9 (i) n In 2 « 2.18 (ti) Jt In 4 « 4.35 

10 2.21 

1 

11 (i) 1.295, 12.71 (ii) - 


Exercise 9.2 

1 2.208 x 10 4 , 2.692 x 10 43 , 1.105,0.452 9 x 10 " 5 

2 (i) 2.303, 6.908, 13.82 (ii) 4.605, 9.210, 13.82 

3 (i) 3 « 3 * (ii) (in) (1 - x) e~ x {iv) (* 2 + 2 *) e* 

X — € X 

(v) -xe ~ ix2 {vi) -- (t/ii) tf~ x (sin* — cos*) 

(via ) e 2x (2 cos 3 * - 3 sin 3*) (ix) 3* In 3 (*) (In * + 1)** 

4 (i) -\e~ 2x (ii) 3 e* x (iii) \t ? 2 (iv) e smx (v) -e ~ ix2 


( A 1 99 

5 (z) 2 v 1 - 7 ) ( * 4 - ( ” V) 1 _ 7; m liiio 

1 4 n" 

6 (i) - (n) (m) — (iv) 1 

e e e 

7 1 

8 (i) nn (ii) nn + \n 

9 (iii) 10.52% (iv) 6.93 days 

10 (iii) Y 600 In 2 « 4.33 x 10” 4 (iv) 4.3% 

11 (i) 565 m (ii) 13.8 min (iii) Never 


Exercise 9.4a 

2 (i) 1 — x + ^x 2 — ^x 3 + J 4 X 4 (ii) 1 + 2 x + 2 x 2 + fx 3 4 - fx 4 
(iii) 1 + \x 2 + J 4 X 4 (iv) x + ^x 3 

3 (i) 1.648 72 (ii) 0.367 88 (iii) 1.105 17 


Exercise 9.4b 

2 (i) ^x — \x 2 + J 4 X 3 — -^x 4 (ii) 2x — 2x 2 + fx 3 — 4 * 4 
(iii) x 2 -f \x 4 

3 (i) 1.099 (ii) 1.511 (iii) 2.610 
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Exercise 9.4c 


1 


2 

3 

4 


(0 1 + yX — ^X 2 + YqX 3 \x\ < 1 

(ii) 1 — x + x 2 — x 3 \x\ < 1 

(iii) 1 — 3x + 6 x 2 — 10x 3 \x\ < 1 

(; iv ) 1 + x + fx 2 + 2^ 3 H < 2 

(y) 2 — + 8 j2 — Ve * 3 w < 2 

(vi) 2 - jx - rex 2 - gV* 3 |at| < 2 

(t) 1.004 99 (ii) 0.997 49 (iii) 2.024 84 
(v) 0.247 512 (vi) 0.126 89 
4 

,.. N 1 2 3 2 4 

i 11 ) ~2 + ~3 + 4 ’ X + o v 2 q„5 


(iv) 3.009 97 


Exercise 9.4c! 

1 (z) X — |x 3 («) 1 — \x 2 4 - J 4 X 4 (Hi) x + ^x 3 (it/) 1 + jx 2 -\~^x 4 

(t/) x + x 2 + -J-x 3 (vi) \x 2 + y^-x 4 

2 0.100 33, 0.017 45, 0.000 29 

3 1.035 53,1.000 58 


Exercise 9.5a 

1 (i) x e x — e x (ii) sin x — x cos x (Hi) \( 2 x — 1 ) e 2x 

X X 1 

(iv) \x 2 In x — \x 2 (v) 2x sin - + 4 cos - (vi) —- (In x + 1) 

2 (t) — f x 2 + 2 x + 2 ) e~ x (ii) 2 x sin x + (2 — x 2 ) cos x 

a 10“ 10" 

(ill) e tan 6 - log sec 6 (iv) j—^ 

3 (i) j e x (sin x — cos x) (ii) \ e x (sin x + cos x) 

(Hi) | e~ x (sin % - cos x) (iv) j e x (sin 2 x - 2 cos 2 *) 

4 (i) *{(ln x ) 2 - 2 In x + 2 } (ii) x tan -1 x - \ In (1 + x 2 ) 

(iii) u sin~ 1 u + J(l - u 2 ) (iv) %(x 2 + 1 ) tan -1 x -\x 

5 (i) 1 — — (ii) in (iii) 2 n 2 — 16 (iv) fin 2 — | 

e 

6 £25 


Exercise 9.5b 


1 


2 

3 


<*’> I <*> I <*'> I 

16 2n 

(*) y y (“0 0 



(v) 


35 71 
256 


W ig (®) 0 


x (In x ) 3 + 3 x (In x ) 2 — 6.x In x + 6x 

/„ = —!—tan"" 1 0-/„_ 2 
n — 1 


3- tan 3 0 — tan 0 + 0 
j tan 5 0 - y tan 3 0 + tan 0 - 0 


4 
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Exercise 9.6a 

-2 3 

1 (0 -r + 


x — 1 


(*») 


x - 2 
4 


(») 


11 12 

— 2 -I-— (Hi) --- 

x - 2 x + 2 v x 2 x 4- 1 


| 1 2 

a:— 1 x — 2 x — 3 ^ x x 4- 1 


(«) 1 - - + 

x x — 2 


1 i 

(viii) x -i - —- -f 2 


(vii) 1 4* 


1 


1 


* 4- 2 


x — 1 x 4- 1 

1 


2 (0 -- l —r + , n 

* * - 1 — 1) 


2_1_2 

2 ( M ) 2 


(m) 


3 (0 - ~ 
(in) 


x — 1 (x — 1) 

1 * («) 


X 4* 1 

4 1 

x+7 (W) 1 “ 4(* + 1) " 4(x - l) 2 


4 


5x — 3 


X X 2 + 1 

1 

3 


X — 1 X + 1 X 2 + 1 


i(* + 2) 


x — 1 x 2 + x + 1 
4 1 — 3* 4- lx 2 — 15x 3 

1 n 


(iv) 


X + 1 


*(* - 2) 

c 2 — X + 1 


5 (0 

(«) 

(iii) 


r(r 4 1) n 4* 1 
1 « 4 1 


1 

1 

-» — 


r(r 4- 2) 2(n + 2) 2 

1 1 


1 


__1 

r(r + 1) (r 4- 2) 4 2 (n + l)(w + 2) 4 


Exercise 9.6b 


x - 1 


1 (0 2 to——T (**) to—— (ill) iIn --- 

x + 1 x + 1 2 — x 

... _ x — 2 . . . , 2x - 1 * 2x — 1 

(iv) X + In—— (v) 4 In-—— (w) 3 In• 


x + 2 


2 (i) - 


2 x + 1 

1 1*4-2 


— (ii) -1— In 

x + 1 V ' 2x 4 x 


x 4- 1 

2 x 

(Hi) -r + In 


x — 1 


(iv) x 


x — 1 


4 - 2 In (x — 1) 


(i) In (x 2 + 9); | tan 1 \ In (x 2 4- 9) 4- tan 1 - 

3 3 

2 x 2x 

(ii) In (4x 2 + 9); £ tan~ 1 —; £ In (4x 2 4* 9) — \ tan“ 1 — 


3 
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(*«) In (x 2 - 6 x + 10); tan -1 (x - 3); 

2 In (a 2 — 6* + 10 ) + 3 tan” 1 (x — 3 ) 

(*) a + In (x — 1) (ii) x + In ——j (iii) %x 2 

x+\ 


{iv) — 


+ In (x — 1) (v) \ In ( x 3 + 1) 


f i\ ( x ~ l ) 2 , 1 _, 2a: -h 1 

(w) i ^ -—7- H- t— tan - 


x 2 + * + 1 ^3 


x /3 


+ In (* — 1) 


Exercise 9.7 


1 


2 


3 


4 


5 


6 


7 


(0 * 5 («) 7 * ? («0 “ (w) (*0 3^ 3/2 (w) s(* + l ) 5 

(w '0 " 4(2, + ! } 2 (™) fV(3v - 2) («) - l) 6 

(*) ~3 ( t 1 ^ (**) i (* 2 + !) 3/2 (*») -7(1 - a ; 2 ) 

(0 2 In * (ii) 3 In (x — 3) (iii) —In (1 — u) (iv) \ In ( v 2 — 1) 
M “"3 (1 *“ * 3 ) (w) ln (x 2 + x) (vii) je 2x ( viii) £ e 3x 

10 * 

(ix) —\e x (x) In (1 + e u ) (xi) -—— (xii) In (In a:) 

In 1U 


x 

(0 2 sin 2 a: (ii) —2 cos- (iii) | sin 3 0 (iv) — cos 3 0 

(v) T 2 sin 30 + f sin 0 (vi) — \ cos 0 + cos 30 (vii) y sin 2 a; 

(viii) \(x - sin x) (ix) j sin 2 x -f \x (a:) 2.>/(sin x) (xi) In sin 0 
(xii) —In cos 0 

(0 tan* (ii) tan* — * (iii) sec 0 (iv) — cosec 0 (v) — cot 0 

(vi) —cot 0 — 0 (vii) | tan 2 0 (viii) | sec 3 0 (i*) —y In cos 2* 

(*) £ tan 2 * + In cos * (xi) 2 In sin { x (xii) — £ cot 2 * — In sin * 

* 2 * 3 * 

(0 sin -1 - (ii) ^ sin -1 — (iii) ^tan _1 2x (it/) ^tan -1 -^- 

(v) In [a + 7(1 + « 2 )] (vi) In [u + 7(« 2 — •)] 

(wV) j[sin~ 1 a: + *7(1 — * 2 )] (viii) — i(l — x 2 ) 3/2 

(ix) £ tan 1 — (*) £ In (4* 2 + 9) 

5 


(i) x sin * -|- cos * (ii) — * e~ x — e~ x (iii) —jt cos 3 1 + \ sin 3 1 
(iv) 2* sin * — (* 2 — 2) cos * (v) * sin - 1 * + yj (1 — * 2 ) 

(vi) * In * — * (vii) 0 tan 0 — In cos 0 (viii) (y* 2 — jx + y) e 2 * 
(i*) — | (sin t + cos f) (*) yg £ 2 ' (2 cos 3/ + 3 sin 3 t) 


(xi) ^* 3 (3 In * — 1) (xii) 


(0 


10 x (* In 10-1) 
(In 10) 2 


0 1 («) In 2 (iii) In (3 + 2 yj 2 ) 
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8 (t) jti (ii) In 2 (m) ^7; In ( 3 + 2 V 2 ) 

x * - 1 . x — 1 , 1 x , * 

9 (»') ln 7TT (m) ln ~r~ (m) n ~7~ + x (u ’ } n V(* 2 + !) 

'(v) In (1 + e x ) (vi) x — In (1 + e x ) (vii ) 2 tan 1 x 2 

(rin) x (In *) 2 — 2# In x + 2 x 

(:ix ) tan -1 x — \x 2 + g In (1 + x 2 ) 

(x) tan" 1 tan 0 (xi) sec" 1 * (xii) sin" 1 x + J(l - x 2 ) 

v\ ' 18 

10 (i) ±ln2 + ire («) i (»**') i(« + 2 ) ( ,p ) ^ 105 

r 

(ui) g(2 £ 3 + 1) ( viii ) ■f 7 C + 2(**) 3^3 ^ 2 ^ 


(**) 


271 

373 


(xii) j7i — 1 


Exercise 9.8a 

2 «*1 + y - 0 <»> s “» (i “ ) “ 1 w, i" 


(i,) x 2 0-2x^ + 2y = O («) 


dy 

dx 


d 2 * 

d* 2 


= x 


Exercise 9.8b 

1 (i) y — A e x (ii) y = Ae~* x2 (Hi) y = x 2 + A (iv) y = 

(v) y = Ax (vi) 2x 2 + y 2 = A 


2 


3 

4 


(0 ?=--> 

d* x dx y 

(a) *=-i, 

dx dx 

dy 2 y Ay x 2 2 _ j 

( “‘ ) 5 - T’ £ " ‘ + 2s 


- 2 * 


( ”> f-- 2s> 

<•> I" "»• S“J »’- 2 ' + ^ 

dy 2 y dy_ * 2 9 2 _ j 

(vi) — =-> 3 - = 3 -’ * - ^ 4 

dx * dx 2 y 

(i) sin x = 2 sin y => y = sin " 1 (5 sin x) 

(ii) y = 2 tan " 1 |x 

5 = ^4 57 000 6 100 years 
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7 160 000 km 

1 


8 x = 


26% 


10 


1 + 9(f ) 2 " 3 
9 171 years, 141 years 

dO 

= a — b0: 3.76 min (a = 21.3, b = 0.170) 


d / 
d M 


11 a ~dT + bM = ^’ 21.2 days 

12 (i) = —g — A :?; 2 (ii) x = In 


1 , 5 + ku 2 


(iii ) 

13 ^ = 


1 

2 k 

10 


, ku 2 
In ( 1 +- 

£ 


(«0 


10 


8 k g 


2 A: £4 kv 2 

14 132 min 



Exercise 9.8c 

1 (i) y — A e~ x 4 x — 1 (ii) y = x (In x 4 A) 

(iii) y = e x + Ax 2 (iv) y = A e~ x 4 x 2 — 2x + 2 

2 (i) y = A(\ - * 2 )~ 1/2 - 1 («) (1 + x)y = ^4 ** - x + 

3 A e~ % 4 £ (sin / — cos /); ^(sin t — cos /) 

4 2«"‘(1 - O 

5 (i) e Rt,L ; A e~ {Rt/L) 4 ^ 2^2 s ^ n ^ — ^ cos 

LE 0 


R 2 - 1 - co 2 L 2 


(i? sin co/ 4 coL cos co/) 


(«) A e t/RC 4- , ^^ 2^2 (mRC sin co/ — cos cot) 


coRCEq 
1 4 co 2 R 2 C 2 


1 4 co 2 R 2 C 2 
(coRC sin co/ — cos co/) 


Exercise 9.8c! 

1 (i) y = Ae 3x + Be x (ii) y = A e 5x + B e~ x 

(iii) y = A e 2x 4 B e~ 2x (iv) y = A + B e 4x 

(v) y = (A + Bx) e 2x 

2 (i) y = Ae x + Be 2x + 3 

(ii) y = A e x + B e 2x + i(2* 2 + 6* 4 7) 

(iii) x = A e~ f + B e~ 2t + ^ / 

(tfl) a = 4 e~ l 4 B e~ 2t 4 yo (sin / — 3 cos /) 

(i) y = ,4* 3x 4 - i* 4 f 

(«) y = 4 

(m) * = 4 f 4- B e 2t 4 to (sin / 4 3 cos /) 

(w) 0 = 4 e~ 2t 4- 5 4 jo (3 sin / 4- cos /) 


3 
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4 (,•) y = e 2x + 2e x (ii) x = e 2t - e " 2 ‘ (iii) 0 = + <T 2 ' - 2 

5 2 (e~ 2t — e~ 3t ); rad (x 17°) when t = In 1.5 (a 0.4 s) 

6 q = Ae- 200t + Be~ 1000t + -± 


Exercise 9 . 9 a 

b 

1 - coth 6 

a 


3 

4 

5 

6 


2 cosh 2 x 


x sinh x 


v) coth * 


(z;) tanh * (vi) — sechxtanh* (vii) — cosech x coth x 
(viii ) cosech x 

(i) cosh # (ii) \ sinh 2 x (iii) tanh x (iv) In cosh x 
(: v) x sinh x — cosh x (vi) x tanh x — In cosh x 
(i) 2 sinh 1 = 2.35 (ii) \n (sinh 2 + 2)= 8.837 


x 3 x 5 

(0 * + 3! + 5! + ' 


2 4 

.... , * X 

<»> 1 +JT + I7 + 


Exercise 9.9b 

1 (i) In 2 (ii) In (2 +■ J 3 ) (iii) | In — - 

1 — x 

/•x .1-1* ....v 1 . , -1 2* .. . . , . 3 * 

2 ( 1 ) sinh - (a) cosh - (in) ^ sinh — (iv) cosh — 

2 3 3 2 

(v) cosh -1 (x + 1) (vi) sinh -1 (x +- 1) 

(vii) #sinh -1 x — yj ( 1 + x 2 ) (viii) * cosh -1 x — yj(x 2 — 1) 

(ix) ^{xyj(x 2 + 1 ) +- sinh -1 x] (x) ^XyJ(x 2 — 4 ) — 2 cosh -1 ~ 

3 (i) sinh -1 2 = In (2 +- y/ 5 ) = 1.44 

(ii) cosh -1 § — cosh -1 1 = In 0 + = 0.96 

(Hi) V 3 - 2 cosh -1 2 = y /3 - £ln (2 + y/ 3 ) = 1.07 
(iv) iV 3 + 4 sinh -1 2 = \yjb +- \ In (2 +- yjb) = 1.48 

1 x 

4 (i) -tanh -1 - 

a a 

y|,a„hVW; J( j) 

Exercise 9.10a 

1 (i) ^(10^/10 - 1) (ii) In (1 + 72) (m) 2(72 - 1 )a 

(iv) a (72 + In (1 + 72)) 

2 
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Exercise 9.10b 

1 (i) \nc 2 ( e 2 — e~ 2 +4) (ii) ^na 2 (2yj2 — 1) (iii) -*yna 2 

(iv ) ^ na 2 

Exercise 9.11 

1 (i) na, \na 2 , (f) = \n + 9 (ii) 8 a, %na 2 , (p = ^(n + 0) 

2 a 2 , (j) = 20 

3 

4 ia 2 ,ia(V2 + In (1 + 72)) 

Miscellaneous problems 9 

1 (i) 2 , 2.25, 2.37, 2.49, 2.59, 2.69 

8 (if/) ^/tC 


Exercise 10.1 


1 (i) —3 — 6 i («) — 2 — 82 ( Hi) 10 + 5 i (iv) 11 + 2 i 
(v) -13 - 19i ( vi) -3 - 4i, -7 + 24*, -527 - 336z 
(t ii) —20 — 20? (viii) —11 —2 i (ix) 61 (, x ) 50 

(xi) ( x 5 — 10 * 3 + 5x) + i( — 5x 4 + 10 * 2 — 1) 

(xii) (x 6 — 1 5x 4 y + 15* 2 y 4 — y 6 ) + i(6x 5 y — 20x 3 y 3 + 6 xy 5 ) 


(xiii) 


1 + i 1 — i 
50 50 


2 (i) --- (ii) 0.2(3 + i) (iii) 0.02(1 - 7i) (w) 1 + 2i 

5 

(v) —2 (vi) 0.008( —11 + 2 i) (vii) 1 + 2 i (viii) cos 0 + i sin 

3 p = Ik, q = 3k (k e Z) 

4 (i) (x 3 - 3xy 2 ) + i(3x 2 y - y 3 ) (ii) (x 2 - y 2 - 2ixy)(x 2 + y 2 )" 

(“) (* + ?f?) + ■'( s “ ?T?) 

5 (4 4- 5i)(2 — 3i); 83 2 + l 2 = 71 2 + 43 2 

6 — 1 + 2i 7 -rr, f\ 

8 x 2 = |(1 + V2),y 2 = R-l + 72); ±(3 — 2i) 

9 2 — i io 1 , 2 

11 (i) — 2 + 5i (ii) l|- + 2 i (iii) i( —1 ± yj 2) 

(w) ?(±7 7 - i) (v) -2, -4 ± 75i 
(vi) 0.1(1 ± 7l9i)(2 - i) 

12 (i) z 2 — 6 z + 4 = 0 («) z 2 + 6 z + 10 = 0 

(iii) z 2 — 4z + 7 = 0 (iv) z 2 — (1 + i)z + (2 — i) = 0 

13 (*) 5, 7, +11 

(ii) — 1 , 7 , — 13; z 2 + z\ < 0 => z l5 z 2 complex, but not 
conversely 
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14 (») £(1 - ? +l )(l + i) ( ii ) 0 . 2(1 - 2*)[1 - (- 2 «H 
(m) Rl - (n + l)i" + «i" +l ] 

15 A = B = -j! 

Exercise 10 . 2 a 

3 z, + z 2 = 3 + 2*; z, - z 2 = 1 - 4»; z,z 2 = 5 4 - 5i; 

^• = 0.1(-1 - 7i); — = 0.2(-l + 7i); z\ = 3 - 4i 
z 2 z, 

4 iz = —12 — 5i; z — 1 = —6 + 12 i; 3z = —15 + 36i; 

— S — 12* 

— 3z = 15 — 36*; z ” 1 =-——; z 2 = -119 - 120*, 

169 

zz' =169 (on real axis) 

6 Line segments AB and CD are equal and perpendicular 
8 — 2 + i, 6 — 3i 9 —4 4 4i, — 2i, —5 — i 

10 6 + 2i, 7 — i; or 0, 1 — 3i 

11 4 1 \i\ —5^ — 4 \i 

12 (3 - 2i) ± V 3 (l + 0 


Exercise 10 . 2 b 

i —2 — 2 i 5 c = —a(D 2 — bco; c — —cud — bco 2 


Exercise 10 . 2 c 

1 (i) 1 cis tc (ii) y/2 cis h) (iii) 2 cis 60° (it;) 1 cis ( — 90°) 

( v) 13 cis (tc 4 - tan -1 (vi) yj 13 cis (tc — tan -1 1.5) 

(wi) 1 cis (|-Tt — a) ( viii ) 1 cis (a — \ tc ) 

2 (i) 2V3 + 2t («) -5 («) 5^2(1 - i) 

(iv) —8 cos 15° 4 8 i sin 15° (v) 2 cos 80° 4 2i sin 80° 

(vi) 3^2(1 - i) 

3 (i) = 5^/2 cis (tc — tan -1 7) (ii) a/b = j^/2 cis (—ire) 

(lit) b/a = ^2 cis^rr (iv) a 2 = 5 cis (tan ” 1 f) 

(v) £ 2 = 10 cis (tc — tan -1 f) (vi) a 2 + b 2 = 5^/5 cis (tc — tan ” 1 2 ) 

5tc 

4 a" = cis frcw; = 2 n cis (— ^htc) ; ab — 2 cis —; 
a t . 11tc 

i = ?C1S l2- 

3tc 

5 cis (0 4 0); cis — = — i; cis 240° = — \ — ^>/3t 

6 (i) cis (4 jtc) (ii) cis (±frc) (Hi) 5 cis (tc 4 tan” 1 £) 

(it;) yj2 cis ^tc or cis (— ^rc) 
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7 (i) z 2 + z + 1 = 0 (ii) z 2 — 2 (cos 36°)z -1-1=0 

(m) z 2 — 2 (cos a)z + 1=0 

11 Apollonius’ median theorem 

12 (i) circle centre i, radius 2 (ii) perp. bisector of — 1, 2 i 

(Hi) interior of circle centre —3 i, radius 2 (iv) portion of plane 
inside circle centre O, radius 4 and outside circle centre 2, radius 1 
(f) ellipse foci +3, major axis length 10 

13 (i) region below real axis (ii) region outside circle centre O, 
radius 3 and inside circle centre 3, radius 3 

14 First circle entirely inside region between concentric circles 


Exercise io.2d 

1 (i) 20 cis 230° (ii) cis ( — 9 — (j>) (in) 1 (iv) cis (— I^tt) = —i 

(v) 2yj2c\%~ (vi) cis = —i (vii) 2(1 + cos a) = 4 cos 2 ^cc 

(viii) 4 sin 2 (— cos + i sin jS) (ix) cis ( — 36) (x) 2 cis 6 

(xi) cis (a + — 2 y) (xii) tan \6 (sin 6 + i cos 6 ) 

3 (i) 2 cos TfO cis j>9 (ii) item ^9 (Hi) ^ sec 9 

,. x i cos 9 . , t _ , . 

(«0 -I . n = * tan (£0 + 47T) 

1 - sin 9 


5 


Rewrite k = 


c 

~b 


a 


a 


6 

7 

8 


a — c a c AC A'C' 

—c = —~ LC ’’ LCmi BC = ¥c'' 
similar in same sense 
Use AABC similar to ABCA 

See no. 5 : l = (1 - k)b + kc, etc. Add three equations 


A s ABC, A'B'C' 


Exercise 10.3 

3 zfz 2 = r 1 r 2 cis ( — 0! + 9 2 ) 

4 z = z* cis 29 => arg z = 9 or 6 + n 

5 (i) 8zz* + z + z* - 1 = 0 

z — 1 z* — 1 47r / 

(«) yyjy = (zz* - 1)(3 + V 3 i) 

(z - z*) (1 - V 3 i) 

8 1 + i, I5 9 2 — i, \ 

10 (z + 2)(z 2 + 4z + 8); 2 cis n, cis 

11 cis (± 60 °), 1 (repeated) 

12 2 ± i, —2 + i 13 1 ± i, 2 ± ? 

15 OA 1 BC => b - c = Aia (A € i?): conjugate and eliminate A 
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17 


1 

b 

b* 


= 0 


Exercise 10.4a 

1 (i) cis (-72°) (ii) cis220° (iii) 10 8 cis (4 tan -1 *}) 

(it-) cis 80° ( v ) 2 7/4 cis ( —52j°) (t>i) 32 cis = 32* 

(t-ii) 2 20 cis 120° {viii) 2 ,/6 cis 15° (w) V 2 cis (-30°) 

(x) cis (50 — j- 7 t) (xi") sec" 0 cis n(j 7 t — 0 ) (xn) cis a 



4 

5 

6 

7 


-110°, -110°, -110°; cis (-94°), cis (-22°) 
2 ,/12 cis (7±° + 60£°), k = 0, 1, 2, 3, 4, 5. 


(i) 1 , cis (±fw) (it) - 1 , cis (+|rc) 

(iii) cis 50°, cis 170°, cis ( — 70°) 

32c 6 - 48c 4 + 18c 2 - 1; 64c 6 - 80c 4 + 24c 2 - 1; 


5 1 - 10i 3 + t 5 
1 - 10i 2 + 5< 4 ’ 


32c 5 - 32c 3 + 6c 


8 

10 

11 

12 

*3 

*5 


z + z 1 = — 1 => z = cis (+120°); z" + z " = 2 cis (nl20°) 
2" cos" ^0 (cos jnd + i sin |«0); sum = 2" +1 cos" cos \nQ 
tan 15° = 2 - 


tan (12° 41' + *36°) (k = 0, 1, 2, 3, 4) 
%kiz or kn + tan -1 —j- 

V2 

cot 2 ^ k = 0 , 1 , 2 


Exercise 10.4b 

1 1 + 3i, -3 + i, -1 - 3i, 3 - i 

2 (i) 2 cis 0(0 = 20°, 140°, 260°) 

(ii) 72 cis 0(0 = -16°, 56°, 128°, 200°, 272°) 

(iii) cis (-20° + £45°) A = 0 , 1 , 2 ,..7 

(it-) cis (10° + £40°) k = 0, 1, 2,..., 8 
(v) 2 3/4 cis (-45° + *90°) k = 0, 1,2, 3 

(t-i) 2 1/6 cis (25° + £60°), k = 0, 1, 2, 3, 4, 5 

(t-ii) cis (221° 4- 90°£) k = 0, 1, 2, 3 

(wit) 2 1/8 cis (8£ + 3) k = 0, 1, 2, 3 
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*2.71 Jc 

3 (0 1 ± \J*$h —2 (ii) z — cis^-> k — 1, 2, 3, 4, 5, 6 


ii 


3k 




(m) cis ±- > cis ±— (iv) cis , A = 0 , 1 , 2 , 3 , 4 



(v) yj5 cis 0, 0 = \(n ± tan 1 jj + fwA), A: = 0, 1, 2 


,,. 4 — 5 cos a — 3 i sin a 2nk 
4 (,) --a- 0,1, 2, 3, 4 

(ii) z = ±i, 0.1(3 - i), -\i, 0.1(-3 - i) 

(iii) 2 1/6 cis (± 15° + *120°), k = 0, 1, 2 

2 nk 

(iv) cis — (k = 0, 1, 2, 3, 4); cis (\n 4- \nk), k = 0, 1, 2, 3 
7 z 2 — 5z + 7 = 0; z 2 + z + 7 = 0 

9 Condition for equilateral triangle (cf. Ex. 10 . 2 b, no. 5 , Ex. 10 . 2 d, no. 7 ) 
cos 50 + 5 cos 30+10 cos 0 


10 (i) 


16 


10 — 15 cos 20 + 6 cos 40 — cos 60 

(11 ) - 

v ; 32 

35 sin 0-21 sin 30+7 sin 50 - sin 70 

(m) - 

V ' 64 

(iv) sin 4 0 cos 6 0 = 2“ 9 (cos 100 + 2 cos 80 — 3 cos 60 


600 — 45 sin 20 + 9 sin 40 — sin 60 
192 


— 8 cos 40 + 2 cos 20 + 6 ) 
; 3n x 2 ~ 9 


12 ^sec a[l — (— l) n cos 2 n(x]; |sec a(— 1)" +1 sin 2 not 
sin (n + 1 )jol cos n\<X 


13 


14 cot 0 


2 sin |a 


[n tan \n<x + cot j(x — (n + 1 ) cot (n + 1 )|a] 


= / 


cos 0 cis 0 
1 — cos 0 cis 0 


Exercise 10.5 


1 


4 

6 


(i) i (ii) —1 (m) £ -1 i (iv) 2 i sin 1 

(v) sh 2 cos 5 — i ch 2 sin 5 (vi ) cis 3 ch 1 — i sin 3 sh 1 

(vii) In 8 + i(2k + \)n (k e Z) (viii) In 8 + i(2k + %)n (k e Z) 

(ix) In 2 + i(2k + ^)n (#) In 2 + i(2k — f) 7 T 

— 2 cos 3x + 3 sin 3* —3 cos 3x — 2 sin 3x 

e -; e 2 - 


(0 

(ii) 

(iii) 


sin x ch y + i cos x sh y; r 2 — ^-(ch 2 y — cos 2 x) 
cos x ch y — i sin x shy', r 2 = -|(ch %y + cos 2 *) 
sin 2 x + i sh 2 y 2 ch 2 y — cos 2 x 

cos 2 x + ch 2 y 9 ch 2 y + cos 2 x 
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7 — tt7i + i In (3 + 2 -y/ 2 )( + 2 Ttk) 

8 sinh 2 y — jyJ 3; cos 2 x — ^/3 
q iv = £ z ; cosh z 

2. v n * 4x - 1 

io tan 2 y = --5-2 5 tanh = 4x , T 

1 — u — v z * + 1 


where £ 4x = 


(1 — m 2 — v 2 ) 2 + 4v 
[(1 — u) 2 + v 2 ] 2 


12 


13 


( i) e 4x (A cos 3 x + B sin 3*) (ii) (-4 cos x + 5 sin x) 

(iii) A cos 2t + B sin 21 + l-j (tv) ^4 cos 3£ + B sin 3t — 7 sin 4^ 
(i) 3* 3 ' sin t (ii) |(1 — cos 3/) + § sin 3* 


Exercise 10.6 

1 (i) translation 

(ii) half-turnabout 1 — 2 i 

{Hi) enlargement x 5 in point —\i 

{iv) enlargement x2 and rotation +771 about O 

(v) spiral similarity about O, x 5 through ( — tan 1 I7) 

(vi) spiral similarity about O, x 5 through 100° 

(vii) enlargement x pinpoint 2 

{viii) spiral similarity x^2 through —\n about O 
{ix) rotation through +\n about —7 — \i 

2 (i) z' = z + 3 i {ii) z' — 3z cis 135° 

(Hi ) z' - i = 3yj2{ — 1 + i)(z - i) {iv) z # = -iz + 2 

(v) z' = 2 + 7 *' — z (vi) z' = 4z — 6 — 3 i {vii) z! = 3z — 2i 

{viii) z' — 3z — 6 i (ix) z' = — 2iz + 1 4- 2 i 

(x) z' = — 2iz + 4 4- 3 i 

3 (i) Rotation +77C about point 1 (ii) reflection in line y = x — 1 

5 (i) Reflection in real axis (ii) reflection in y = x 

(iii) reflection in y = x tan a (iv) glide reflection in line^ = x + 1 
{v) glide reflection in y = * — 1 

Exercise 10.7 

1 (i) Circle centre O, radius 2 (ii) imaginary axis (iii) y + x = 0 

(iv) real axis 

(v) portion of line y = 1 — yj 3 x in first and fourth quadrants 

(vi) ellipse, foci O and 1 — i (vii) circle, diameter 1,-1 
(viii) circle, diameter f, I7 (ix) line segment joining 1 and —i 

2 2 

(x) ellipse — + — = 1 
4 3 

2 (i) Interior of circle (ii) exterior of circle 

(iii ) circumference and exterior of circle, centre — 4i, radius 3 

(iv) exterior of circle, centre 3 — i, radius 2 

(v) interior of ellipse in no. 1 (x) (vi) all points for which y > 0 
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4 
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io 
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The locus of points whose distance from — 1 and from — 16 are 4:1 
has centre O, radius 4 

M = i 

3zz* H- 4zz — 4zz* +4 = 0; 

zz*( 1 — A 2 ) + z( — a* + A 2 b*) + z*( — a + X 2 b) + aa* — A 2 bb* = 0 


(0 

(«) 

(zzz) 

(«0 

(») 

(w) 

(z>ZZ ) 


half-circle z = 3 

semicircle z = 2 from — 2 i, — 2 , 2 i 
semicircle z — i through — 1 , z, 1 
the same moved + 2 in direction of real axis 
semicircle |z| = 2 through — 2 i, 2 , 2 z 


whole circle 
whole circle 
(viii) quadrant of circle |z| = 1 
Semicircle |z| = 1 through 1 , z, 


\z\ = 1 , beginning and ending at 
z + 11 = 1 


1 


(z) the same translated + 1 in direction of imaginary axis 
(zz) the same enlarged x 2 in origin 
(zzz) the same rotated +\n about origin 

(iv) the circle |z| = 1 anti-clockwise from 1 to 1 , thence along 
real axis to O 

(v) the semicircle |z| = 1 clockwise through 1 , — z, — 1 , thence 
along real axis to — go , and from + oo to + 1 

(z) From 0 to 1, thence along parabola y 2 = 4 — 4x to z, thence 
along y 2 — 4 + 4x to — 1, thence to 0 
(ii) from 1 to e, thence round circle |z| = e to e cis 1 , thence along 
radius to cis 1 , and finally along circle |z| = 1 to 1 


from 1 to cosh 1 , thence along ellipse 
ch (1 + «), thence along hyperbola 


ch 2 

J2 


+ 


y 


sh 2 1 

„,2 


= 1 to 


cos 2 1 sin 2 1 


= 1 to 


cos 1 , finally along real axis to 1 


Exercise io.8 

i Imaginary axis with y > 0 2 Anti-clockwise along \w\ = 1 

3 |z + 4 — 3z| = 5; 3x — 4y = — 24 

4 Coaxal system of Apollonius circles based on the points 1 and — 2 i ; 
the origin side of the line 2x + 4z/ + 3 = 0 

8 x = constant circles centre O, radius e x 
y — constant half-lines, arg w = y 

10 2 — z, — 1 + 2 z 

11 Three quarters of the circle x 2 + y 2 — x + y = 0 , starting at O, 
passing through 1 — z, and finishing at 1 

12 Confocal hyperbolae and ellipses 
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Miscellaneous problems 10 

1 a = + 2, b = ±3; (z 2 — 4z + 13)(z 2 + 4z + 13) 

3 Rotation through + 0 about c\ half-turn about yj3i 

4 (i) 2 + 3 i, ± 1 

(ii) the points of |z| = 2 in the third and fourth quadrants 


5 (z) True (ii) false, e.g. a = 0, b = 1, r = i 
(iii) true, cf. Ex. 10.2a, no. 7 

6 (z‘) r = 1 ;/(z) describes line segment [—1, 1] 

(in) 9 = 0 , real axis, * ^ 1 

9 = 71, real axis, # ^ — 1 

9 = +2 71 ? imaginary axis 

7 If U, P, Q represent 1, z, z 2 , and if Z_UOP = 0, then UP and PQ are 
inclined to real axis at angles 20, 30 (As OUP, OPQ are isosceles) 

71 2.7Z 

8 cos 2 — , cot 2 — roots of 5z 4 — 10z 2 + 1=0; 2 , 

5 5 

11 x e 2nk 



12 

13 


x = constant hyperbolae 

y = constant ellipses 

cos 2a sin 2a^ 
sin 2a — cos 2a ) 


Exercise 11.1a 

1 (0 i( a + 6) («) 4« + (i«) 4« + 46 (iv) fa - 

(y) —fa + |6 


Exercise 11.1b 

1 (i) |(6 + c), f(c + a), f (a + b) (ii) f(a + 6 + c) 

2 (i) |6 + fc (ii) {a + fc (Hi) fa + |6 
(iv) (v) (vi) fa + f6 + fc 

3 (i) —26 + 3 c (ii) fc — fa (iii) —a + 26 

4 r = f(l — t)a + jtb + jtc, 
f(a + 6 + c) 

5 A = i^ = f 


Exercise 11.2a 

1 (i) f(6 + c + 4 ) (ii) f(a + 6 + c + d) 

(iii) f(a + b + c + d) 

3 (i) fa + f 6 + fc (ii) (f, f, f) (iii) - 0 . 4 i + 2 . 1 / — 0 . 3 fc 

4 12 cm 5 y,¥ 
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Exercise 11 . 2 b 

i (0 (If, 2f) (?'?') (lH, Iff) (iii) (5f, 0 ) (iv) (f h,\a) 

a (i) (If, 0 ) (ii) ( 1 ||, 0 ) (iii) ( 6 , 0 ) (iv) (\h, 0 ) 

3 x — fa 5 j^na 3 , fja 6 x = 2.71 

Exercise 11 . 3 b 

1 (i) 3, 18° 27' (ii) 1,70° 32' (iii) -4, 161° 31' 

2 (?) 2i + 2k, 3 j + 2k, 66 ° 55' 

(ii) 2j + 2k, -M + 3j, 64° 54' 

(iii) — 2i + 3j - A, -2i - 3j + A, 115° 23' 

(w) 3/ - k,j - 2k, 45° 

3 Lines meet at (2, 1, 1), 70° 30' 

4 (if) 

11 - 2tt, - 2*! 

5 «i - *1 
\ — 4 + «, + 

Exercise 11 . 4 a 

1 (?) r = 2Ai +j + 2(1 — A) A; x = = - 

(ii) t = At" + (1 + X)j + (2 + A) A ; x — y 1 — z 2 

2 (?) 2x + y + 3z = 13 (??) x 4 - 2y + 3z = 8 


3 (?) 61° 53' (??) 38° 13' 

Exercise 11.4b 

1 (?’) i + 2j + 3A, i-j- k, 128° 6' (ii) i - j,j + k, 120° 

2 (?) 45° 35' (ii) 38° 34' 

3 (?) 3, 1 (??) lx + 56 y — 5 = 0 and 16* — 2y + 35 = 0 

4 (?) 6, 3 (ii) 3x + 3y + 4z + 6 = 0 and x — y — 8 = 0 

5 (?') 35° 17' (ii) 54° 43' (Hi) if 

6 V 5 7 3 

8 2* + 2y — z = 9, (4, 1, 1) 

Exercise 11.5 

1 i - j,i + j,-i - j + 2k 

(: i) i - j, i + j, i-j + 2k (ii) 90° (iii) 54° 44', 90° 

(iv) 39° 14', 39° 14' 



2 
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Exercise 11.6 

1 (i) r = e 9 (ii) 2 e 2t , 2 e 2t ; 0, 4 e 2t 

4 — 20 ^ 2 , 10 ; ~ 20 / 2 , 2k, 10 


Exercise 11 . 7 a 

1 (i) 3 x 10 4 Ns (ii) 10 5 N s (iii) 0.75 Ns {iv) 2.7 x 10 " 23 Ns 

2 (0 300 Ns (ii) 12 Ns («*) 80 N s (iw) 2 Ns 

3 (; i ) 2 000 Ns (ii) 2 000 N s {in) 2ms -1 

4 (i) 1 000 N s (ii) 1 000 N s (fit) 1 ms -1 

5 (i) 667 N s {ii) 667 N s (tit) 0.667 m s " 1 

6 (i) -40 000 Ns (ii) 260 000 N s {in) 26 ms -1 

7 (i) -20 000 Ns (ii) 280 000 N s (iii) 28 ms " 1 

8 (i) -20 000 Ns (ii) 280 000 N s (iii) 28 ms " 1 

9 4 Ns, 200 N io 400 N 

11 (i) 0.001671 « 0.005 N s (ii) 0.005 N 

12 50 N 13 1 500 N 

14 50 N, increase 

15 7.2 x 10 6 N s, 7.2 x 10 6 N s, 7.2 m s ' 1 

Exercise 11 . 7 b 

1 (i) 12t + 4j + 2k (ii) —3t — 9/ + 6 A: (m) 2i + 8 fc 
(iv) t 4- 1.23; 

2 (i) 10* + 5; + 3A: («) ft — f/ + 5A: (m) 5t + 3/ + 4A: 

(iv) 4.5t + 3.6 7 + 2k 

3 (i) —2 t + 10A: (it) — 2.4t — 1.6/ + 1.6/r 

4 10»N 
36 bv 36 

Exercise 11 . 7 c 

1 (i) 2i + 3/ + k (ii) ft + ^/ — k 

-( 1.8\ ?2.4\ 

2 (0.8/ ± V8.4/ 

3 (600, 400, 20) 4 1 600 N 

5 ^ km s _1 , 245 km s _ 1 ,5s 

6 (i) 30°, 8 v (ii) 25.2 mv 

Exercise 11 . 7 c! 

1 if, 31ms ' 1 2 3.6 ms - 1 ,0.4 

3 -f, +2 ms -1 4 1 ms~\f 

5 10, 7.5 ms -1 8 0.952,40.7 m 
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Exercise 11 . 8 a 

1 (i) 300 J (it) 5.48 m s“ ‘, 4.27 m s " 1 

2 8 000 N, 32 000 N 3 3.2 x 10 4 N 

4 100 J, 0.064 

5 (i) 4 x 10 5 J (it) 5 000 N (iii) 10 000 N (iv) 20 000 N 
mv 2 

6 — 7 2.4ms-'; 0 . 6 J 

8 2.1,2.6 ms -1 ; 0.45 J 
g 1 . 8 , 2.8 m s - 1 ; 0 
10 (i) 18.2 N (ii) 6.6 N 

Exercise 11 . 8 b 

1 (i) 8 N (ii) 1 N (iii) 0.2 m (») 0.05 m 

2 (i) 0.16 J (ii) 2.5 x 10 -3 J (iii) 4J (iv) 0.25 J 

3 (i) 2 x 10 3 N m -1 (ii) 0.004 J 

4 (i) 5 x 10 4 Nm -1 (ii) 2.5 J 

5 (i) 4 m (ii) 400 J (iii) 800 J 

6 (i) 200(2 + x) (ii) 25x 2 (iii) 400 + 200* - 25* 2 , 9.7 m 

7 4.06 x 10 4 J, 15 m, 32.8 ms -1 , 53.9 m s -2 

8 (i) gR (ii) }gR 

9 y/2gR (« 11.3 km s -1 ), ^7? (« 8 km s -1 ) 

Exercise 11 . 8 c 

1 (i) 8 J (ii) — 4J (iii) 0 

2 (i) 3.6 x 10 5 J (ii) 45.9 m 

3 (i) 9.9ms -1 (ii) 10.8 ms -1 

4 (i) 10 ms -1 (ii) 500 J (iii) 5.1m 

5 (i) 3.13 ms -1 , 0.39 N (ii) 4.43 m s - \ 0.98 N 
(iii) 3.83 ms -1 , 0.73 N 

6 (i) 784 J (ii) 6.26 ms -1 (iii) 1 180 N 

7 11 ms -1 

8 (i) y 2 = 2 ag(l — cos 9) (ii) mg( 3 cos 0 — 2 ) (iii) cos -1 

Exercise n. 8 d 

1 2.26 kW 

2 9.5 kW 3 0.039 ms -2 

4 (i) 1.47 ms -2 (ii) 1.47 ms -2 

5 4.9 x 10 4 W 

6 2 160 N, 1 700 N, 80 m 

7 25 N s, 25 N, 0.85 m s -1 


W|N> 
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8 —- kg s 1 , i> Mgv W 
v 

P l2Pt ... l3Px . 2 l2Pt 3 

0 (i) a = — (ii) v = /- ( 222 ) ^ = /- (w) * = - /- 

y V ' mv K V m \J m V m 

Fu 

10 mv + At/ 2 = F (v ^ U), mv =-Af 2 > £/); 

t; 

[f 3 [fu 

^j-(F<kU 2 )^—(F^kU 2 ); k = 0.5; 
4.98,4.78,4.38,2.35 ms" 2 ; 10.6 s 


Exercise 11.9 

1 ( 2 ) if = — 4x ( 22 ) 3.14 s (m) * = 3 cos 2^ (it/) 6 ms " 1 

3 (0 2 («) 6.28s (iii) | (it/) 0.13 

4 ( 2 ) 0.63 s (ii) 0.68 s 


7 (i7c + ^V^ 

8 mg(l — cos wf) 

v . 11 


V(*o” 




/j 4 T q ml 

10 6 =-- 0, 71 I — 

ml v T 


Exercise 11 . 10 a 

1 ( 2 ) i + k (ii) — 5i + 3/ — * ( 222 ) 2 2 — j 

(it/) -lit + 9/ - 5A: (v) -4i — 3/ + A: (vi) —12* - 67 - 12* 

3 ( 2 ) Force 61 + 3/ along 3* — 63 / + 4 = 0 ( 22 ) Couple 4* 

( 222 ) Equilibrium ( 22 ;) Force 2 i + 15 7 along 15* — 2y — 2 = 0 

4 (30/*, 30/*) along jf — y = force (30P, 307*), couple IO 0 P 


Exercise 11 . 10 b 


1 

2 

3 

4 
8 


10 


700 N, 800 N; 75 kg 

(i) 0.213 N (ii) 0.106 N 

|fF(4x — # 4 - 6 ),jkF(— x + 4^ + 6 ) 

1 li kg, 85 - kg 5 \W cot J? 6 40° 

(i) W 7 tan 15°, tan 15° as 0.27 (ii) ^W 7 , tan 30° as 0.58 


(in) IW 7 , 


f/ from A 


1 


0.29 


2V3 

5 mg/ 2 mg/(18a — 5/) 
6a ’ 


12a 
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2d — l cos 6 

11 -;--- 

l sin 6 

12 (i) 2yJ\3P,6x — 8# = 5a ( ii ) Pa (Hi) 6 Pa 

Exercise ii.ua 

2 ft + j, towards O 

Exercise n.nb 

i %M(a 2 + b 2 ) 2 \Ma 2 , \Ma 2 , 2 Ma 2 

3 \Ma\ \Ma 2 4 § Ma 2 ,\Ma 2 

5 fMa 2 , §A/a 2 

6 (?) ToMa 2 (ii) — (a 2 + U 2 ) (iii) — (3a 2 + 2h 2 ) 


Exercise ii.iic 


(*') 

12 kg m 2 s~ 1 ; 

36 J 

(ii) 

0.48 kg m 2 s~ 1 ; 

0.96 J 

(iii) 

0.24 kg m 2 s~ 1 ; 

0.48 J 

(iv) 

96 kg m 2 s~ 1 ; 

288 J 

(v) 

168 kg m 2 s' 1 ; 

877 J 

(vi) 

7.15 x 10 33 kg m 2 s~ l ; 

2.60 x 10 : 


2 (i) 2F/ma; (ii) ma(o/2F ; (iii) maa> 2 /4F 

3 5.82 N, 1.8 s, 58.2 N 

4 62.8 rads -2 , 126 rad s -1 , 78.5 N 

5 3.65 kg m 2 

6 (i) mga, J3g/2a, 3g/4a; (ii) 2 mga, yj3g/a, 0 

7 38.7 rad s -1 

8 (i) i mgl sin 9; (ii) 9 = sin 6; (iii) 9 = 

(iv) 2nj2l/3g 


9 

10 



2 t iy/5a/3g 


(ii) 


2 Mg 
2 M + m 


Exercise 


1 (i) ML'T ’ 2 (ii) ML ’ 1 (m) ML ’ 2 (iv) T " 1 (i;) T ’ 1 

(vi) T ’ 2 (vii) 1 (viii) ML 2 (ix) MT ’ 1 (x) M - 1 L 3 T ’ 2 
(xi) MT ’ 2 (xii) MT ’ 2 

2 (i) T oc (ii) A cc ^ (iii) » <x Jt {iv) /oc 

(v) V cc 7^ 


3 T = kv 2 pA; M _ 1 LT 2 
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Miscellaneous problems 11 

3 {a 2 - 2 l 2 ) 


from common base 

4(2 a + 3/) 
cos 2 P + cos 2 y — 2 cos a cos p cos y 


7 Distance 

9 


sin 2 a 


)u 


dv 

10 — = — cv + g 
a t 

r = - (ct + e~ ct — \)g + - (1 - e~ 
c c 

11 A = 18* + 3/; B = —2 i — 3/ 

^Fsin a g\ * gt 

12 y -X-iT + T') { '-’ >T 

13 Ap( 1 — k)u 2 ; 4.5 ms -1 

15 3 h 35 min, 6.19 x 10 6 Ns, 1.03 x 10 3 N 

la 

16 71 - 

I g 



3m’ 3 m 


19 P = + 2^A) x 10‘ 


6 = 


Av 2 


10 /’ 


dV 


20 M _ 'L°T; — = -XApgV 
a t 


Exercise 12.1a 


I 

1.5, 0.75 

2 

(i) 2.92 ( ii) 5.83 

(: Hi ) 1.46 

3 

4.5, 8.25 

4 

5.96, 1.52 


5 

aXL 

*36 

6 

1.94, 1.44 


7 

1.73, 3.13 

8 

5.58, 0.60 


9 

3.61,6.69 

10 

£2.18 


11 

29.75, 230; 

0.5 



12 

P(T= 30) 

= P>P{T = 

45) = 1 - p; 45 - 

\bp,22bp{\ 


p(T = 30) 
40 - 

= b,P(T = 

35) =hp(T= 45) 

= i(i -/>); 
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Exercise 12.1b 


2,2 


3 36, 


5 

7 

9 

10 

11 

12 


2 - t* 

48 
yjn 

0.59; 7,6.3; 8.26 

(?') 0.96,0.88 (ii) 1.58 
8, 7.92; approximation np\ 

(i\ (,7\ 11. 20 40 

n 27 W 27 > 3 j 9 


2 (?) 2,1 (?’?) 18,3 (??'?) 50,5 

3 3^2 

4 2 4 

6 4.49, 2.47 


1 - 01 % 


3 j 9 

(~t 4- -/ 2 4 !/3\5. 1 121 

' 3 l T j j 2435 243 

*2 

12, 60 


r 6 


(6 — 5/) 2 5 

13 (0 (i* 4 + Ir 1 ) 25 , 0, 10 (ii) 16.67, 11.78 


Exercise 12.2a 

1 a = 2 ; — + —; (0 A (*0 f 

2 X = (?) 0.001 (ii) £ 

3 (0 i (ii) §; 


4 

5 

6 


O(a) = 

V 6 

l - 3. 

/I — 4 , 



0(0 = 



* < 0 

0 ^ a: ^ 3 

3 ^ ^ 4 

4 < * 


x < — 1 
-1 < x ^ 1 
1 ^ x 


(?) ii (ii) ®(*) = U(2 + 3* - * 3 ) 

,1 

co x < 0 

0>(x) = j i(l — cos nx) 0<*< 1; (?) \ (ii) \ 

ll 1 < x 

1 


2 000 


*~ r/200 °; ^ 0 .135 ^-) 0 145 (•••) 0.OI8 


Exercise 12.2b 


1 11 
65 36 

2 11 
35 18 


5 o 


1 1 

2 ’4 


2 3 

25 4 

0 ,i 



6 

7 

8 

9 

10 

11 


0 M = 


l 


for a ^ x ^ b, and zero elsewhere; 
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a + b (b - a) 2 


3/4 _ 3 _ 
20 


b — a 

a = ? 2, f 

3 
2 

o 1 ^ In 2 1 
2 + j> 2 H j 

<£(/) = I0.135; 5 000 

c = h ; L53a; ^ 0-53 (”) 1; k = L28 


12 


ia (0 A = - (?'?') { (ii) j 

n 

13 X, X 


Exercise 12 . 3 a 

1 (?) 16 (ii) 92 (Hi) 33 (iv) 23 

2 (?) 6.7% (?■?') 10.6% (?■?•?') 0.6% 

3 (?) 0.319 (??') 0.145 (???) 0.030 (iv) 0.537 

4 0.753 5 (?) 6 («) 210 (???) 34 

6 66.77,2.28; (?) 7.9% (?'?') 66.2 cm 

7 3.63, 1.71; 2 (since x takes only integer values) 

8 (?) 0.675 (?'?') 1.65 

9 (i) 9.1% (?■?) 37% (in) 530.8 

10 63 g 

11 (?) 54.6 min (ii) 44.7 and 49.3 min (??? ) 39.4 and 44.7 min 

12 (?) 5.1% (??) 103.26 mm (???) less than 1% exceed 107 mm 

13 1.004 cm, 0.020 cm 

14 14.6, 5.2 months 

15 74.35,3.23 

16 7.52 a.m. 

17 (?) 8.48 a.m. (??) 5 min, 118, 4 entrances 

18 7.9%, 15.5%; 24.45 mm, 6.1% 


Exercise 12 . 3 b 

1 (?) 0.018 (ii) 0.729 (???) 0.067 

2 (?) effectively zero (??) 0.999 (???) 0.89 

3 (?) 0.052 (?'?') 0.198 4 (?') 0.131 (?'?') 0.063 

5 (?) 0.222 (?’?) 0.018 (???) effectively zero 

6 (?) 0.827 (ii) 0.062 (???) 0.021 

7 (?) 0.228 (??) 0.051 
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8 O') 7.8% (ii) 95.8% (iii) 0.8% 

9 n = 610; 0.09 


Exercise 12.3c 

2 0) 0 . 5 ,0.1 (ii) 0.5,0.05 


3 

4 



Exercise 12.4 

1 (i) 0.368 (ii) 0.368 (iii) 0.080 

2 (i) 0.330 (it) 0.865 

3 (i) 0.018 (ii) 0.195 (iii) 0.156 (iv) 0.0003 

4 (i) 0.165 (ii) 0.269 (iii) 0.373 

5 (i) 0.003 (ii) 0.037 

6 52 

7 228, 211, 98, 30, 7, 1, 0 (to the nearest integers) 

8 Theoretical frequencies: 3, 9, 16, 17, 14, 9, 5, 2.5, 1 , 0.4, 0 —close 
correspondence with observed frequencies, suggesting lucky and 
unlucky tenants 

9 2, 1.97; so n « a 2 , as in case of Poisson distribution 

10 50 

11 1.05; 0.350,0.367,0.193,0.0675,0.0177; 105; 27 

13 (i) 0.997 5 (ii) 0.849 


15 20,60,90,90,67,40,33; (i) £157.50 (ii) £167.50 (iii) £160.75 

16 (i) 0.406 (ii) 0.184 (iii) 0.37 

Exercise 12 . 5 a 

2 (i) aE[ X -] + bEly~], a 2 V[x] + i> 2 F[y] 

(ii) aE[x] 4- a 2 V[x] + 2 ab Cov [x, y] + b 2 V\_y~\ 

3 (i) aE[x\ 4- bE[_y] 4 - cE\_z] (ii) a 2 V[_x~] 4- b 2 V\_y] 4- c 2 F[z] 

4 10.5,8.75 5 10.5, 14.58 6 0,2.42 

7 Bob has the advantage; Alan’s variance =11.67, 

Bob’s variance = 11.25 

8 12.25 

9 (i) 6 f (ii) 13^ (iii) 26§ (iv) 33-j 

10 60 minutes, 3.16 minutes; (i) 0.005 7 (ii) 0.057 (iii) 0.565 

11 (i) 0.168 (ii) 0.027 
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12 EVi,V(E^); 150 s, 8.124 s (i) 0.7% (11)11% 

i i 

13 0.638 

14 Viz] = [(A,®, + A 2 6,) 2 + (A,a 2 + A 2 i 2 ) 2 ]<r 2 

Exercise 12.5b 

1 (1) 0.057 (11) 0.57 2 (1) 0.072 

3 0.048 4 (1) 0.115 

5 (1) effectively zero (11) 0.998 7 

6 0 . 001 , suspicions justified 

7 136 8 96 

Exercise 12.5c 

1 58.5, 31.7 2 5.08, 0.55 

3 (1) 1.79 (11) 1.64 (111) 1.61 

4 (11) Z A, = 1 

1 

5 V[x] = —; £[j 2 ] = a 2 ; A = —-% 

n m + n — 2, 

Exercise 12.6a 

1 Significant at the 1% level 

2 Significant at the 5% level 

3 (?) Significant (n) Not significant (Hi) Significant 
(iv) Not significant evidence against the hypothesis 

4 Significant at 5% level that techniques are effective 

5 Significant at 1% level that support has decreased 

6 (i) Not a significant result 

(ii) Significant at 1 % level that claim is not justified 

7 ( i ) Significant (ii) Not significant evidence against the hypothesis 

8 Significant at 5% level that cards were not drawn at random 

9 Consistent with hypothesis; significant at 0.1% level that hypothesis is 
incorrect 

10 (*) 0.000 4 (ii) 0.043; not significant at 5% level that germination 
rate is less than 75% 

11 Significant at 1 % level 

12 Highly significant, almost conclusive evidence 

13 (1 - X ) n ; (0 0.107 (ii) 0.028 (iii) 0.001 

Significant at 0.1% level that hypothesis is incorrect 

14 0.109; (i) 527 (ii) 537 

15 475 


(ii) 0.165 
(ii) 0.008 
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Exercise 12 . 6 b 

1 (0 58.8,67.2 («) 58.0,68.0 (m) 56.5, 69.5 

2 (0 1 150.1, 1 153.9 (ii) 1 149.6, 1 154.4 

3 (0 154.6, 163.6 (it) 153.2, 165.0 

4 (i) 7.35, 7.49 (it) 7.32, 7.52 

5 (i) 54.1,60.7 (ii) 53.05,61.75 

6 Highly significant that there is bias; 0.2 to 0.6 

7 1.996 5,0.064 5; 0.006 5; 1.98 to 2.01 

8 No; yes 

9 #1, *1 ± 0.776; *, * + 0.163 


Exercise 12.7a 

2 y = 0.58* + 8.4 

3 w = 0.367" + 40.4; 14^=58 

4 y = 1.4a: — 26.3 

5 (i) y = 0.28* 4- 69.8 (ii) y = 3.6* — 36.4 

6 y = 33.6* + 8.1 

7 y — 1.24* — 24.4 

8 F — 0.9/ — 6.3; 9 

9 y = -1.28* + 20 ; 16.2 

i° (i) y = 0.8* + 1.5 (ii) y = 1.9* — 3.1 

11 19.14, 15.03; y = 0.33* 4- 8.8 

12 log jo Y = 0 . 021 X - 37.2 

13 (i) 15.2, 1.72 (ii) 149 (iii) 2.3 (it;) a — 1.5, b = 0.015 


Exercise 12 . 7 b 

2 0.88 just significant at the 0 . 1 % level 

3 0.99 highly significant 

4 0.92 highly significant 

5 0.28 not significant 

6 0.72 just significant at the 1% level 

7 0.23 no significant correlation 

8 0.35 

9 correlation coefficient of —0.84 significant at the 1% level 

10 0.65 11 0.43 12 0.65 

13 (i) 0.61 significant at 5% level (ii) 0.45 not significant 

(iii) 0.93 significant at 0.1% level 
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Exercise 12 . 7 c 

1 p = 0.66, t = 0.47 not significant 

2 p — —0.08, t = — 0.02 tastes quite dissimilar 

3 (i) 0.79 (ii) 0.61; both significant at the 1% level 

4 0.90 5 0.22 

6 p = 0.79 significant at the 1 % level 

7 (i) p = 0.92 (ii) t = 0.78 

8 (t) t= —0.71 (ii) not quite significant 

9 ACBDE, ABDCE, ABCED; 120; & 

10 p = 0 . 74 , t = 0.62 significant at the 1 % level 

11 t = 0.67 significant at the 1 % level 


Miscellaneous problems 12 


kr-k 



3 (0 


9! ’ 8 ’ (k 

(1 -k)( 1 + i) 


(«) 


1 )! 

(1 - *)(1 + 20 


4 

5 


1 + (1 - k)t 

(?) | (??) 3f min 

(?) 68 (?'?) 404 (?'?'?' ) 471 


1 + (2 - k)t 


' n 
\ r 

36, 180 


/(l -P) n ~ r -,e- 


m*- 


-kp 


nl 


r\ 


8 50^2 + 16.3cr 


9 (0 I («) if 

10 (i) — np(\ — p ) («) X = 1.96^/w (m) 0.077 

11 (i) p(A x ) + p(4 2 ) + p(A s ) - p(A { A 2 ) - p(A 2 A 3 ) 

- 

lP(Ai) -ZZp(^) 

i 

(«) 1 - “ + H-h ( — 1)" ~y, which tends to*" 1 


12 

13 

14 
*5 


p(f + 5f) = (1 - 0&)/>(O + ctdt(l - />(*)) 

x , /fl — 

(?) />„ = i (ii) p, 


1 r 

b n = 2 1 + 


0.621 

/27V - r - 1 
^ “ ( N - 1 


d b 
2N 


2\2N 


2N~ 1 


A - 17 2 
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Exercise 13 . 1 a 

621 46 33 

66 41 12 

59 51 4 

\108 45 12 


1 041 

22 

19 

26 

265 

1 061 

22 

21 

18 

010 

, 987 

18 

35 

18 

090 



Exercise 13 . 1 b 



/ 24 

4 

1 050 

3 

20 

51.0 

1 | 

32 

0 

1 507 

5 

32 

47.1 


^28 

9 

967 

5 

19 

50.9 

2 i 

b = 

3a; 

d = 

5a; 

/ = 

2 a; 


columns 1, 2, 5 are 
linearly dependent 


h is independent of the others, including £ 

3 b = 3 a; c = — 2 d; 3 a + 3 d — f — 0 ; a — 2 d — e = 0 , etc. 

4 col. 2 = — 3 col. 3; replace 5 by —4 

5 row 1 — row 2 — row 3 = 0; 2 col. 1 + col. 2 — 7 col. 3 = 0; 

col. 2 — 7 col. 3 — col. 4 = 0, etc. 

6 x = — 30; yes: 8 col. 1 + 6 col. 2 — 47 col. 3 = 0 

7 H 2 0 + 0 ^ 14220 ! j contains same as 6C 2 H 4 0 2 



(~\ o\ /o -1\ / 0 1> 
V 0 - 1 / b 0 / V-i 0 ; 
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2 


3 

7 


8 

9 

10 

11 



x and y stretches; enlargement and half-turn; enlargement x 5 and 

rotation; Ar-shear; y-shear; reflection in y = — x and enlargement; 

2 0 \ . . 

(x-shear and enlargement); reflection in 


2 -1 
1 2 



1 -i 


0 1 7\0 2 


y — x tan ^a; reflection in line y = x tan (45° 4- |-a); rotation +40°; 
all points map into line y = ^x ; (*, y) (#, x) lying on y — x 
Reflections in y = 0, * = 0, y — +x; rotation through ^nn, (n e Z) 



all lying on line 2 y + * = 0 



0 

-1 


(parallelogram) 


vectors in direction (1,1) unchanged; 
vectors in direction (1, —3) x 25 


shear in line y = x 


cos 2a sin 2a 
sin 2a— cos 2a 
0 0 



2-1 2 

2 2 — 1 > 

/ 0 —y/2 -J2\ 

lix) i I V2 1 — 1 | and inverse 
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'5 0 o\ /1 0 o' 

0 5 01 (xa) Jo 0 0 

,0 0 5/ \0 0 1, 

2 ( i ) Reflection in y = 0 (ii) orthogonal projection on to z = 0 
(Hi) 120 ° rotation about x = y = z ( iv ) quarter turn about x-axis 

(v) x and y stretches, z reflection 

(vi) enlargement x 3 and central inversion 

(vii) rotation through tan -1 f about z-axis 
(■viii) rotation through tan - 1 f about #-axis 

3 -1,0, 1, 1, - 6 , -27, 1, 1 

5 All points have images in the plane x 


2y - z = 0 


-1 

°\ 

/1 

-1 

1 

1 

1 

-1 

-1 

-1 

0 

0 

1 

1 

-1 

1 

-1 

1 

-1 

-1 

0 

-1/ 

\i 

1 

1 

-1 

-1 

-1 

1 

-1 

H 

-1 

1 

-1 

1 

-1 

1 




1 

-1 

1 

1 

1 

-1 

-1 

-1 



\-i 

-1 -1 

1 

1 

1 

-1 


f 



8 Glide reflection in line y = x ; + + translation; + — and — -f glide 
reflection; — — half-turn 


Exercise 13.3a 


AB = 


6 1 
7 2 


BA = 


BD = 




DB , DA , AC, BC impossible 


4 AB = BA = 



a b \ ( lb 
ka kb ) \ — la 



2 1 3—1 4 ^ 

5 _7 -4 -1 3 

-8 -11 -1 5 0 ) 


5 (* 2 * !) 


= (0 0 0) (or transposed) 
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55 1 


10 


a ka'' 


a 

[ ~l ~ a 


a b 


where be = 1 — a 2 


\ c a j 

7 /, A, A 2 = B 2 , A 3 , AB, BA, A 3 B, B 3 A ; twelve in second group 

Ik 0 0 \ 

8 Only scalar matrices: I 0 k 0 I commute with all square matrices 

\0 0 kj 

9 Multiplication of complex numbers: (a l + ia 2 )(b i + ib 2 ) 

1 0 0^ 

0 1 0 

0 0 l y 

'—14 — 10* 21 - III s 

-21 - lit -14 + 10iy 


— 14 — 10^2 27 - 11^2 \ 

-27 -11^/2 -14 + 10^/2/ 


ii 


' 13 -21 
. —21 34 


90 -144 
■ 144 233 


etc. (Fibonacci) 


^ where r = s = (p — ^ (a — d) {b ^ 0 ) 


13 C 2 = 


1 -1 

1 0 


c 6 = 


'-1 o s 
o ~i / 

14 a = l, b = l, c=— 1 , rf = 4, e — 0 ,f = 3 , g = 2 , h = — 1 , k = —1 
16 cos 6 = £ x x x 2 A/(£ x i £ x l) 


'l 0 g\ 


2 0-3 


17 (x y 1) 0 1 f\\y ; (x y z) 0 4 15 


U / c i 


\-3 \b 



19 Clockwise rotation of quadrilateral about O through ^7 1 

20 Five points in three dimensions rotated through 120° about line 
x — y — z 


Exercise 13.3b 


1 


2 

3 



A, quarter-turn; B , reflection; C, enlargement; D, shear 
P, 120 ° rotation about x = y — z; Q half-turn about z-axis; R , re¬ 
flection in plane x = z; S, enlargement x 2 

R X R 2 = ^ 2 ^ 1 > formulae for cos (a + /?), sin (a + /?) 
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4 


6 

7 

9 

io 


*3 
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^3 0 \ / — 1 0 


0 3 , 

0 -3 
-3 0 

COS gTT 
I 


13 


0 -1 
0 

-1 
sin 3^71 
Sin j7l — COS j}7Z 

(cos a — sin a 
l sin a cos a 


0 -P 



enlargement and half-turn 
enlargement and reflection 
enlargement and reflection 


tan a = ^ 


shear and enlargement 
x- and y-stretches and quarter-turn 

shears and y-stretch 



1 
1 

/-I 
0 - 
0 

/-I 
0 
0 

10 1 
1 0 
\0 0 

Suspect 120° rotation about line 2x = y = 2z 
Vectors (0 0 z) 

e.g. permutation matrices, rotations, etc. 

/»' 

e.g. from no. 7 , M = N 2 , X = ( 0 

\ l J 

Must be an orthogonal matrix, with unit determinant (see Ex. 13 . 4 a, 

no- 13 ) 

/+1 0 0 ^ 

Composition of a permutation matrix with one of I 0+1 0 

\ 0 0 ±h 

Reflection, rotation and translation +k parallel to ^-axis 


Exercise 13.4a 


C -!"! 


0 0-2 


oi _1 
4o 9 


1 
2 2 


-2i 


4 ad ^ be 
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where A = a 2 — be , etc.; A = E tf 3 — 3flta = 0 when a + b + r = 0, 
or a = b = c 

- 0-0 

n M 2 = /, see Ex. 13 . 3 a, no. 6 ; e.g. half-turn, reflection, etc. 

/ —25 4 3 \ 

12 M' 1 = I 26-4 —3 J(M') -1 = (AT 1 )' 

\ —33 5 4 / 

/ cos a cos P — cos a sin P sin a\ 

15 | sin a cos P sin a sin p —cos a I matrix orthogonal ; rotation 


— sin p cos P 0 
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Exercise 13.4b 

2 t, 2f+l 3 -2 and 4-5 

4 Non-trivial solution when a x b 2 — a 2 b x # 0 


Exercise 13.4c 

3 Using row 1 on p. 421 

(*) DTQARODR.SPWEOYjEAjKGNQIWFBBFG 
(if) COATISX,GZLDEAZSOWN.RMEJWDAKCDNVTP 
WTMFAUO.RXF 


4 



E.GHEE QKF A 


Exercise 13.5a 


1 

2 


(0 


-5 

b 

b n 


(ii) 54 (iii) 0 (iv) 2 670 (v) 2 232 (vi) 0 (vii) 0 
has factor a — b 


3 

4 

5 

6 
8 


(i) 4 ab (ii) 1 (iii) cos 29 (iv) 1 (v) — 1 — t 4 

(vi) (ps - rq)(x - y)z 


e.g. 


= A x (ps — qr) 


t a + qb ra + sb 
'C + qd rc - f sd 

a y + x b + xy 
c q p d p q 
A = — 1,7 7 x = 0, — 1 

* = — 2 i, y = \ (ar - cp) 2 = (ir - - bp) 


b 

d 


Exercise 13.5b 

1 (i) -1 («) -3 (iii) -21 (iif) -19 320 (t>) 0 

2 A = a 3 4- b 3 4 - c 3 — 3 abc, A, A, —A; a 3 4 - £ 3 ; 1 - 2 * 3 4 - a : 6 

4 See alien cofactors, 13 . 5 , p. 000 5 —2, 8 , 9 

6 Any pair of rows or columns are linearly dependent 

7 (i) 2 abc (ii) 0 , (iii) 0 

8 (i) 1 4- a 2 4 - b 2 4- c 2 4- 2 abc = 0 (ii) be 4- ra 4- ab 4- 1 = 0 

9 Si 


\P q r 

0 p q 
I a b c 
0 a b 
0 0 a 


s 0 
r s 
0 0 
c 0 
b c 


= 0 


11 
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Exercise 13 . 5 c 

1 (i) 276 (ii) -ab(a - b){ 1 - a)(l - 4) 

(m) (a + 4 + c) 3 

(iv) — (a + b + c)(a + boo + cco 2 ) (a H- boo 2 + coo) 

2 (a + b + c) (a + bco + cco 2 ) (a + boo 2 + coo) 

= (a H- b + c) (J] a 2 — X be) 

3 (a — 1 )(a 2 + cl + 1 ); (1 + a + a 2 ) 2 (l — a ) 2 ; 0 

4 (i) (b — a) (a — c)(c — 4) (ii) (a + b + c)(b — a) (a — c)(c — b) 

( Hi ) ^ be — ^ a 2 (iv) (b — a) (a — c) (c — b) (be + ca + ab) 

(v) 0 

5-1 6 ( 2 a + b) (a - b) 2 ; ~ ( 2 a + b) (a - b ) 2 ; 0 

0 c b 2 

7 (i) c 0 a = 4a 2 b 2 c 2 ] (ii) (b — a) 3 (iii) —8 

b a 0 

(iv) (a+ b-\-c + d) (a — b + c — d) (a+ b — c — d) (a — b — c + d) 

(v) 0 

1 2 3 4 I 


8 e.g. 

Exercise 13.5c! 


2 3 4 5 

3 4 5 6 
1111 


1 e.g. D'D = 


a 2 + c 2 ab^r-zti: 
ab + cd b 2 4 - d 2 


2 e.g. 


1 0 
0 1 


+ 


1 1 
1 1 


2 1 
1 2 



a 2 + be 

ac c 2 


a 2 -b c 2 ab ac 

3 

ab 

be + ac be 

- 

ab b 2 + c 2 be 


ab 

ab ac + b 2 


ac be a 2 + b 2 


a p 0 

2 



4 

b q 0 

= 0 




c r 0 




5 

a + xb+yc + z\ = 

\a b 

< c\ + \a b z\ + \x b c\ ■ 


+ a y z\ + \x y c\ + \x y z\ 


9 Must have value of determinant = 1 

Exercise 13.6a 

1 ( n ) 


— — —-— — —— = no solutions 


(iii) 


-21 -14 0 

y -1 


In + mp mn — Ip l 2 + 




556 ANSWERS TO EXERCISES 

2 (0 3, —5, —1 (see 13 . 6 , Example 1 ) 

(ii) infinity of solutions: 3 x eqn. 1 — 2 x eqn. 2 = eqn. 3 (see 13 . 7 , 
pp. 445 - 6 ) 

3 (br - cq) 2 + (ar - pc) 2 = (, aq - bp) 2 

5 a + b + c = 0=>x = y = z; or a = b = c=>x + y + z = 0 

6 A = 3, — 1 , — 1 ; k = 3, x:y:z = 1:0 : 1 ; 
k = —1 ,x:y:z = 3:4:—9 

7 3 x = a + b + c; 3 y = a + boo 2 -f coo; 3 z = a + boo + coo 2 
9 p = -1 (A, - 3 k, k + 3 ) 

10 k = 1, no solutions; k = — 2, (X, 5X + 8, — \ — 2) 

Exercise 13.6b 

15, 10 f r . 

1 x = 2 + —> y = —— 2 k = -5, c = -i 

3 (i) A = 0, x\y\z = 1:0:1 
(ii) values other than 0 or 1 : 

2 A - 3 1 _-A 2 + 4A - 3 

* ” 2 A(A - 1 )’ V ~ 2(1 - 1)’ Z “ 21(1 - 1) 

4 ( — 3, 6 , 2 ), ( 2 , 1, 6 ); (sin a sin /?, —cos a sin /?, cos a cos /?) 


Exercise 13.7a 

1 (t) No solutions (ii) — 4j, 19, 7 ^ (iii) — 1 , 1,1 (it;) 0 , —- 2 

-106 111 109 
65 ’ 260’ ^6 

2 — 1, —1,1 3 x:y:z:w = 1: — 2:0:1 

/I 2 0\ /5 0 0\ 

4 e.g. M 2 = JO 1 O] M, = I 0 1-3 1 det4 = 46 

\0 01 / \0 0 1 / 

5 a: = 5.23, y = -6.91, 2 = -2.33 

6 x = 0.898, y = 0.763, z = 0.618 


Exercise 13.7b 

1 (t) 


«*(;=?) 


(it>) singular (v) 



/n 

2 

" 5 \ 

/ 72 

-32 

“ 4 

0 

4 

( mm ) tm( 132 

-37 

\-i 

2 

-1/ 

\—60 

70 



2 
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(*) No solutions {ii) — 4^, 19, l\ {Hi) —1, 1, 1 

/ ■ \ 106 HI 109 

W — 65 j 2605 26 

( 6 0 — 2 \ 

2-1 l); -2,3*,8{ 

0 -1 3/ 

6 k = 2; * = 6 — 5z, y = 3z — 2; 

A = 2 + 2z, 7 = 8 - 8 z, Z = 8 - 9z 

y (z) 5£, —12/, t {ii) no solutions {iii) 5t + 5, —12 1 — 10, t 
{iv) no solutions 

8 All points map into the plane 2x + y — z = 0 


l dl 

*1 

Cl \ 

2a, 

<N 

1 

-b a 3 — 0 

° 

-2 

~ l \ 

9 ( *2 

b 2 

C 2 I 

where 2b, 

<N 

I 

+ b 3 = 0 , e.g. 

1 

0 

° 

\«3 

h 

c 3 

2c, 

— 4r 2 

+ < 3 = 0 

\* 

4 

2 / 


( 5a 5 b 5c\ 

3 a 3 b 3 c j 
2a 2b 2c) 

11 All 2x2 minors zero; rows and columns linearly dependent in pairs 


Exercise 13.8 


1 


e < 3 - 2^/2; 


'l - e 1 + e 01 


no, let M 1 = jl 1 + « 1 — « 0 I = 2 


0 


0 


'1 1 1 \ -1 1 0 \ 
1 1 0 I + e I 1 -1 0 
\0 0 2 / \ 0 0 0 / 


= 2 {A + eB), and similarly M 2 — 2 (C + eD). Work out successive 
products.. . M 1 M 2 M 1 M 2 M 1 . neglecting terms in e 2 , e 3 ,... 


Miscellaneous problems 13 

1 (*> y, 2 ) ' — > (x, -z, y ); {x, y, z) ■ — ■> (z - 1 , y, -x + 1 ); 

( x,y,z ) 1 > (y - 1, -z, -x + 1); 

x ~ y, x — z — 1 , y = 0 , z = 1 . 

Reflections in three planes would have a fixed point 

2 Closure 3 r 2 = s 2 = e\ D 2 , as on p .502 

4 8 6y = 41* + 176 

5 (3/, 3 — 4/, 2 — 5/) through points (0, 3, 2) and (3, — 1, —3) 

7 Let / = {lmri),x= {xyz),a = (0 £ c). Equations state that a = l x x, 
so we must have a, / perpendicular, i.e. al + + cn = 0 ; also a, * 

perpendicular, so 0 # 4- by + cz = 0. A vector parallel to a x / satisfies 
/* + my + tzz = 0 , i.e. {bn — cm , cl — an , am — bl) {l 2 + m 2 + n 2 )' 1 
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8 Least 6 (3 zeros in same row or column); greatest 1 1 (no two zeros in 
same row or column) 

9 (abc)~ l (b - c)(c - a) (a - b) £ a~ r b~ s c~' {r + s + t = n - 1 ) 

io No 11 cos n8 


Exercise 14.2a 

1 No 2 (i) Exclude b + d = 0 (ii) exclude Q = 0 

3 12346 5, 41236 5, 34125 6 , 234156 



{M: det M < 0}, x not closed 
{M: det M = 0}, x closed 

5 No. 120° turns about two altitudes may combine to give a half-turn 

(see 14 . 4 , p. 512 ) 

6 Include 11 for closure. { 1 , 13} 


N 

z z~ 

Q Q + 

R R + 

c 

R\Q 

cis 8 

x + iy: x,y e Z 

+ 1 

1 

1 

1 

1 

1 

1 

1 

0 

0 

1 

- 0 

1 

0 

1 

0 

1 

0 

1 

0 

0 

1 

x 0 

1 

0 

1 

1 

1 

1 

1 

0 

1 

1 

-r 0 

0 

0 

1 

1 

1 

1 

1 

0 

1 

0 


(zero excluded for x and -r-) 

8 Multiplication closed, addition not closed 



f+g 

f°g 

g°f 

f 

g 2 

(0 

1 

1 — X 

X 


x — 1 

*(1 — x) 

1 — X 

X 

X 

(«) 

In x + sin x 

In (sin x) 

sin (In x) 

In (In x) 

sin (sin x) 

(iii) 

(-3 + 2x - 2x 2 ) 

x + 3 

-x + 3 



2x - 1 

2x + 1 

2x - 1 

X 

X 

{iv) 

4 

9 — 4x 

7 - 4* 

4x — 3 

4x - 5 


10 (i) Yes (ii) no (Hi) no ( iv) yes 



+ 

• 

X 

2D 

1 

0 

0 

3D 

1 

0 

1 


14 (i) Closed (ii ) yes (iii) yes provided x 7 h y 
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Exercise 14.2b 



(0 

(«) 

(iii) 

(») 


{vi) 

(vii) 

c 

1 1 

1 

1 

0 

1 

0 

0 

A 

1 0 

0 

0 

1 

0 

1 

0 


3 (ba)c # b(ac) 4 (be)d ^ b(ed) 7 Associative 

8 e.g. k = — 1 , put x = cot <x.,y = cot 

9 Not associative; (0, 0), (a x — a 2 , a 2 — aJ; no. Invalid because asso¬ 
ciativity assumed. (B ° A) ° ^4* ^ 7? 0 (-4 0 A*) 


Exercise 14.2c 

1 S, 0; symmetric difference: A A B — (A n B') u (B n ^4') 

2 (i) {2, 4, 8 } x (mod 14) closed (ii) 10 

3 (i) 27 + 13 = 0; 12 + 28 = 0 (mod 40) 

(ii) 17 x 13 = 1 (mod 20) (iii) 53 x 17 = 1 (mod 100) 


5 (0 0.04 


'3 -4 
4 3 


(rotation) ( 


. / 0.6 0 . 8 \ 
\ 0.8 - 0 . 6 / 


(reflection) 


(iii) 

(vi) 

(0 

{vi) 

(0 


0 -1 

1 1 

cos 80° 
-sin 80° 
1 


1 — x 
3* - 7 


(») 


(*») 

sin 80° 
cos 80° 
2 



(rotation) 
2{x - 1 ) 


x A- 1 
(vii) 


(iii) 


(iv) x 1/3 (v) log fl * 


+ 1 


5x — 3 

A 2 3 4 5\ /I 2 3 4 5 6 ^ 

\4 1 5 2 3j {) ( 456132 , 

10 Inverse requires a 3 A b 3 + c 3 — 3abc ^ 0, i.e. a A b + c ^ 0 and 
a , b , c not all the same 


(iii) 


1 2 3 4 5 6 7 
4 7 5 1 3 6 2 


Exercise 14.3a 


1 (i) 8 (ii) 4 (iii) 120 (iv) n (v) 8 (vi) 4, 8 , 6,8 

4 (i) Fails for closure, identity, inverse (ii) fails for inverses 

(iii) no inverses (iv) neither inverses nor identity (v) not closed 
(vi) not closed (vii) include identity (viii) include — i 
(ix) not closed 
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5 (0 b 2 = f = e= (bf ) 3 (ii) *(fc) ^ (fli)r 
(m) table (with v and w interchangeable) : 



1 

p 

q 

r 

s 

t 

u 

V 

w 

1 

1 

p 

q 

r 

s 

t 

u 

V 

U) 

p 

p 

q 

1 

t 

U) 

V 

s 

r 

u 

q 

q 

1 

p 

V 

u 

r 

w 

t 

s 

r 

r 

t 

V 

s 

1 

U) 

q 

u 

p 

s 

s 

U) 

u 

1 

r 

P 

V 

q 

t 

t 

t 

V 

r 

w 

P 

y 

1 

s 

q 

u 

u 

s 

w 

q 

V 

1 

t 

p 

r 

V 

V 

r 

t 

u 

q 


p 

U) 

1 

w 

w 

u 

s 

p 

t 

q 

r 

1 

V 



1 

X 

-1 

— X 

1 

1 

X 

-1 

— x 

X 


-1 

— x 

1 

-1 

-1 

— x 

1 

X 

— x 

— x 

1 


-1 


7 24 

8 T(x) = —x + b (reflection in %b) 

10 e.g. G = R + , x,g = 2 

Exercise 14 . 3 b 

1 3 2 = 9, 3 3 = 13, 3 4 = 11, 3 5 = 5, 3 6 = 1 (mod 14) 

3 12 4 7 4 = 1 (mod 15); {3, 6 , 9, 12} x mod 15 (identity 6 ). 

5 {4, 8 , 12, 16} x mod 20 (identity 16) 

f 1 1 x 1 ] 

6 <x, 1 - *, 1 -> -’ -t’ (D 3 , as on p. 509 ) 

[ XL— XX— l X) 

7 (0 2 (ii) 2 (m) 4 («;) 10 (0 12 

9 Z> 3 , as on p .509 

10 /, A , AB, ZL42?, ABAB = 5^4^ (Z> 4 , as on p. 511 ) 

11 No finite additive subgroups of C, except {0} 

12 e.g. generated by (1 2 3 4 5 6 7); generated by (1 2 3 4 5) (6 7) 

J 3 C 9 ; 2 of period 3, 6 of period 9; 

Ci 2 , 4 of period 12, 2 of period 6 , 4 and 3; 1 of period 2 

14 Generated by 2 , 6 , 11 or 7 

15 (o d 2 

(ii) eighth symmetry is central inversion, (x, z) ( — x, —y, —z); 
group is C 2 x C 2 x C 2 . 
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—-V Remove elements having x = 0 
x xj 

Exercise 14.4a 

1 1 <-> 1 i 

— 1 <-► 9 «-► x 2 
{i, -i) <-*■ {3, 7} <-» {x, x 3 } 

3 (i) non-Abelian 3^ Abelian (ii) D 2 , C 4 
(iii) C, + = E 2 , + ^ ^35 + 

(w) impossible to set up one-to-one correspondence between R + and Q 

4 Must be infinite 

6 x (e S) <-> 2nx (e rotations of circle) 

7 C 6 ; {/, g} <->• {r, <} of period 6 

8 11,2,4,5,7,8}; {1,3,5,7}; {1,3,7,9}; {1,5,7,11}; n prime 
io All six permutations of a, b, c are automorphisms 

12 a «-* a r (r = 1, 2, 3, 4, 5, 6); cis <-► cis (r = 1, 3, 5, 7) 


Exercise 14.4b 

1 (i) D 2 (ii) D x (iii) D 4 (iv) C 4 (v) D Q (vi) C 2 

2 



3 Mirrors at —, group is D n 

n 

4 See Ex. 14.3b, no. 10 

5 gf{x) = -few = 2 - \ W)* = l ’ D 4 

6 a 2 — b 2 = (ab) 3 — 1 

7 ar = r 5 6 7 * 9 10 11 a = g\ ar 2 — r 4 a = /; <2r 3 = r 3 a = d\ 

ar 4 = r 2 # = r; zzr 5 = ra = b 

9 Nine of period 2, eight of period 3, six of period 4 

10 24 of each 

11 (i) (ii) (iii) see no. 3 (iv) (v) 
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